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MOMENT/SUM-OF-SQUARES HIERARCHY FOR COMPLEX 
POLYNOMIAL OPTIMIZATION 

CEDRIC JOSZtS AND DANIEL K. MOLZAHN* 


Abstract. We consider the problem of finding the global optimum of a real-valued complex 
polynomial on a compact set defined by real-valued complex polynomial inequalities. It reduces to 
solving a sequence of complex semidefinite programming relaxations that grow tighter and tighter 
thanks to D’Angelo’s and Putinar’s Positivstellenstatz discovered in 2008. In other words, the 
Lasserre hierarchy may be transposed to complex numbers. We propose an algorithm for exploiting 
sparsity and apply the complex hierarchy to problems with several thousand complex variables. They 
consist in computing optimal power flows in the European high-voltage transmission network. 

Key words. Quillen property, Lasserre hierarchy, Shor relaxation, complex moment problem, 
sparse semidefinite programming, optimal power flow. 
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1. Introduction. Multivariate polynomial optimization where variables and data 
are complex numbers is a non-deterministic polynomial-time hard problem that arises 
in various applications such as electric power systems (Section 4), imaging science [8, 
13,29,66], signal processing [1,6,18,45,48,49], automatic control [70], and quantum 
mechanics [33]. Complex numbers are typically used to model oscillatory phenomena 
which are omnipresent in physical systems. Although complex polynomial optimiza¬ 
tion problems can readily be converted into real polynomial optimization problems, 
efforts have been made to find ad hoc solutions [35,36,67]. We observe that relaxing 
non-convex constraints and converting from complex to real numbers are two non- 
commutative operations. This leads us to transpose to complex numbers Lasserre’s 
moment/sum-of-squares hierarchy [41] for real polynomial optimization. 

In 1968, Quillen [61] showed that a real-valued bihomogenous complex polyno¬ 
mial that is positive away from the origin can be decomposed as a sum of squared 
moduli of holomorphic polynomials when it is multiplied by (\zi\ 2 +.. . + \z n \ 2 ) r for 
some r £ N. The result was rediscovered by Catlin and D’Angelo [17] and ignited a 
search for complex analogues of Hilbert’s seventeenth problem [23,24] and the ensu¬ 
ing Positivstellensatze [26,58-60]. Notably, D’Angelo and Putinar [25] proved in 2008 
that a positive complex polynomial on a sphere intersected by a finite number of poly¬ 
nomial inequality constraints can be decomposed as a weighted sum of the constraints 
where the weights are sums of squared moduli of holomorphic polynomials. Similar 
to Lasserre [41] and Parrilo [56], we use D’Angelo’s and Putinar’s Positivstellensatz 
to construct a complex moment/sum-of-squares hierarchy of semidefinite programs 
to solve complex polynomial optimization problems with compact feasible sets. To 
satisfy the assumption in the Positivstellensatz, we propose to add a slack variable 
z n+ 1 € C and a redundant constraint |zi| 2 +- ■ .+\z n+ i\ 2 = R 2 to the description of the 
feasible set when it is in a ball of radius R. The complex hierarchy is more tractable 
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than the real hierarchy yet produces potentially weaker bounds. Computational ad¬ 
vantages are shown using the optimal power flow problem in electrical engineering. 
In addition to global convergence of the bounds, the complex hierarchy is endowed 
with sufficient conditions for extracting feasible points that are globally optimal. 

The theoretical contributions of this paper regarding the complex hierarchy are: 

1. its construction using real-valued Radon measures (Section 3) leading to a 
new notion of complex moment matrix and localization matrix (Remark 3.1) 
different from existing literature [21]; the Lasserre hierarchy [41] can thus be 
viewed as a special case of the proposed complex hierarchy (Figure 2); 

2. a proof of global convergence (Proposition 3.2, Corollary 3.4); a sufficient con¬ 
dition for strong duality (Proposition 3.10); Karush-Kuhn-Tucker conditions 
involving complex sums-of-squares (Corollary 3.12); a multi-ordered hierarchy 
to exploit sparsity while preserving global convergence (Section 3.7); 

3. a solution to a newly defined truncated complex moment problem (Theo¬ 
rem 3.8) different from existing literature [21, Theorem 5.1] which implies 
Curto and Fialkow’s solution of the real truncated moment problem (Corol¬ 
lary 3.9); as a result, sufficient conditions for extracting global solutions from 
the complex hierarchy (Proposition 3.5); 

4. an invariant complex hierarchy whose convergence can be deduced from an 
invariant version of D’Angelo’s and Putinar’s Positivstellensatz (Proposi¬ 
tion 3.13); in particular, an action of the torus in the complex plane (Propo¬ 
sition 3.14) and a subgroup of it (Proposition 3.15) are considered. 

The paper is organized as follows. Section 2 uses Shor and second-order conic 
relaxations to motivate the complex moment/sum-of-squares hierarchy in Section 3. 
Using a sparsity-exploiting algorithm, numerical experiments on the optimal power 
flow problem are presented in Section 4. Section 5 concludes our work. 

2. Motivation. Let N, N*, R, R+ and C denote the set of natural, positive 
natural, real, non-negative real, and complex numbers respectively. Also, let “i” 
denote the imaginary unit and denote the set of Hermitian matrices of order 
neN*. Consider the subclass of complex polynomial optimization 

(2.1) QCQP-C : inf z H H 0 z s.t. z H H i z^hi 1 i = l,...,m, 

z£ C n 

where m £ N*, H 0 ,..., H m £ H„, h 0 ,..., h m £ R, (•) H denotes the conjugate trans¬ 
pose. The Shor [65] and second-order conic relaxations of QCQP-C share the following 
property: it is better to relax non-convex constraints before converting from complex 
to real numbers rather than to do the two operations in the opposite order. 

2.1. Shor Relaxation. For H £ ]HI n and 2 £ C n , the relationship z H Hz = 
Tr {Hzz h ) holds where Tr (•) denotes the trace 1 of a complex square matrix. Let 0 
indicate positive semidefmiteness. Relaxing the rank of Z = zz H in (2.1) yields 


(2.2a) 

SDP-C : inf 

zeM n 

Tr(H 0 Z) 


(2.2b) 

s.t. 

Tr (HiZ) ^ hi, i = 1,. 

.., m 

(2.2c) 


z 0 , 



1 For all matrices A, B £ C" Xn , Tr(AB) = j< n A ijBji. 
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Let R eZ and Im Z denote the real and imaginary parts of the matrix Z G C" 


respectively. Consider the ring homomorphism A : (C 71 


x) 


2nx2n 


5 +! X ) 


(2.3) 


A (Z) := 


ReZ —Im Z 
ImZ ReZ 


To convert SDP-C into real numbers, real and imaginary parts of the complex ma¬ 
trix variable are identified using two properties: (1) a complex matrix Z is positive 
semidefinite if and only if the real matrix A (Z) is positive semidefinite, and (2) if 
Z u Z 2 G H„, then Tr [A(Zi)A(Z 2 )] = Tr \A{Z X Z 2 )\ = 2Tr(Z 1 Z 2 ). This yields 


(2.4a) 

(2.4b) 

(2.4c) 

(2.4d) 


CSDP-R : inf Tr(A (H 0 )X) 

X(zS2 n 

s.t. Tr(A {Hi)X)^hi, i = l,...,m, 

X *= 0, 

A B T \ A = C, 

B C J & B t = —B, 


X = 


where § 2 n denotes the set of real symmetric matrices of order 2 n and (•) indicates 
the transpose. Note that the set of matrices satisfying (2.4d) is isomorphic to C nxn . 
A global solution to QCQP-C can be retrieved from CSDP-R if and only if rank(X) G 
{0,2} at optimality (proof in Appendix A). In order to convert QCQP-C into real 
numbers, real and imaginary parts of the complex vector variable are identified. This 
is done by considering a new variable x = ( (Rez) T (Im^) T ) and observing that 
if H G H n , then z H Hz = x T A(H)x = Ty(A(H)xx t ). This gives rise to a problem 
which we will call QCQP-R. Relaxing the rank of X = xx T yields 


(2.5a) 

(2.5b) 

(2.5c) 


SDP- 


inf 

xes 2 n 

s.t. 


Tr(A(U 0 )X) 

Tr(A (HJX) < hi, 
X > 0. 


i = 1,..., m, 


A global solution to QCQP-C can be retrieved from SDP-R if and only if rank(X) G 
{0,1} or rank(A) = 2 and (2.4d) holds at optimality. We have val(SDP-C) = 
val(CSDP-R) = val(SDP-R) where “val” is the optimal value of a problem (proof 
in Appendix B). The number of scalar variables of CSDP-R is half that of SDP-R 
due to constraint (2.4d). This constraint also halves the possible ranks of the matrix 
variable, which must be an even integer in CSDP-R whereas it can be any integer 
between 0 and 2 n in SDP-R. The number of variables in SDP-R can be reduced by a 
small fraction ( , 2 ^ +1 to be exact) by setting a diagonal element of A' to 0. This does 
not affect the optimal value (proof in Appendix C). See Figure 1 for a summary. 

2.2. Second-Order Conic Relaxation. In SDP-C of Section 2.1, assume that 
the semidefinite constraint (2.2c) is relaxed to the second-order cones 


( 2 . 6 ) 


Za Z j 

Z" Zj. 


> o , 1 < i ^ j ^ n. 


Equation (2.6) is equivalent to constraining the determinant ZuZjj - ZijZZ and 
diagonal elements Za to be non-negative. This yields SOCP-C : infzeH„ Tr (H 0 Z) s.t. 
(2.2b), \Zij\ 2 ^ ZuZjj for 1 < i j ^ n, and Za > 0 for i = 1,..., n where | -1 denotes 
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QCQP-C 


Z = zz H 

Relax rank Z = 1 


SDP-C 


Identify real and Identify real and 

imaginary parts: imaginary parts: 


/Rez\ 

i 

/ ReZ 




2 

i Im Z 
\ 2 

rLJ 


X = xx T 

QCQP-R--——->- SDP-R ± CSDP-R 

Relax rank X = 1 ™ , , 

Global Global 

solution if solution if 

rank X = 1 rank X = 2 


Fig. 1 . Non-Commutativity of Complex-to-Real Conversion and Relaxation 


the complex modulus. Identifying real and imaginary parts of the matrix variable Z 
leads to CSOCP-R : inf Xe § 2 „ Ti(A(H 0 )X) s.t. (2.4b), (2.4d), Xfj + X* +i>j < X H X 3j 
for 1 ^ i ^ j ^ n, and Xu ^ 0 for i = 1,..., n. In SDP-R of Section 2.1, assume that 
the semidefmite constraint (2.5c) is relaxed to the second-order cones 

(2.7) ( * ij 'Wo , 

This leads to SOCP-R :infxes 2 „ Tr(A(iJ 0 )^Q s.t. (2.5b), Xfj < XuXjj for 1 < i / 
j ^ 2n, and Xu ^ 0 for i = 1,..., 2 n. We have val(SOCP-C) = val(CSOCP-R) ^ 
val(SOCP-R) (proof in Appendix D). The number of variables of CSOCP-R is half that 
of SOCP-R due to constraint (2.4d). The number of second-order conic constraints in 
CSOCP-R, equal to —is roughly a fourth of that in SOCP-R, equal to . 

2.3. Exploiting Sparsity. The properties of chordal graphs enable sparsity 
exploitation for the Shor relaxation [73]. Given an undirected graph (V,£) where 
Vc{l,...,n} and £ C V x V, define for all Z £ H„ 

oo, m {7 \ . f Z v if (bi)e£ or i = j e V, 

(2.8) *(v,£)(S)«-| 0 else . 

We associate an undirected graph Q to QCQP-C whose nodes are {1,..., n} and 
that satisfies Hi = 'kg(Hi) for i = 0,...,m. Let H+ denote the set of positive 
semidefinite Hermitian matrices of size n and let “Ker” denote the kernel of a linear 
application. Given the definition of Q, constraint (2.2c) of SDP-C can be relaxed to 
Z £ + Ker without changing its optimal value for any graph Q whose nodes 

are {l,...,n} and where Q C Q. Consider a chordal extension Q C G ch , that is 
to say that all cycles of length four or more have a chord (edge between two non- 
consecutive nodes of the cycle). Let Ci,...,C p C G ch denote the maximal cliques of 
G ch . (A clique is a subgraph where all nodes are linked to one another. The set of 
maximally sized cliques of a chordal graph can be computed in linear time [68]). A 
chordal extension has a useful property for exploiting sparsity [32]: for all Z £ H n , 
we have that Z £ H+ + Ker Tgch if and only if 'I'c i (Z) 0 for i = 1,... ,p. Note that 

^Ci( z ) £■ 0 if and only if A o ^>g.(Z) 0, where “o” is the composition of functions. 
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Given a graph (V,£), define for X £ S 2n 


(2.9) 


^( V ,£)(^0 


^(V,£){B) ^(V,£){C') ) 1 


using the block decomposition in the left hand part of (2.4d). Notice that Ao\I/( V j £) = 
't(v.f) ° A. As a result, (2.4c) can be replaced by 4'c, ( X) !>= 0 for i = 1 ,... ,p without 
changing the optimal value of CSDP-R, with an analogous replacement for constraint 
(2.5c) in SDP-R. If in SDP-R we exploit the sparsity of matrices A (Hi) instead 
of that of Hi , the resulting graph has twice as many nodes. Computing a chordal 
extension and maximal cliques is hence more costly. Sparsity in the second-order 
conic relaxations is exploited using the fact that applying constraints only for ( i,j ) 
that are edges of Q does not change the optimal values of CSOCP-R and SOCP-R. 

3. Complex Moment/Sum-of-Squares Hierarchy. We transpose [41] from 
real to complex numbers. Let z a denote the monomial z J * 1 • • • where z £ C n and 
a £ N" for some integer n £ N*. Let |a| := aq + •. -+a n and define w as the conjugate 
of w £ C. Define z := (zi,... , z n ) T where z £ C”. Consider the sets where d £ N 


C[z] := { p : C n -> C | p{z) = E| a \^iPaZ a , l £ N, p a £ C }, 
C[z,z]:= { / : C" —>• C | f(z) = E|a|,| 0 |^l fot,gZ a z^, l £ N, f a ,p £ C }, 
1 J R[z, z]~ { / £ C[f, z] | f(z) = f{z ), Vz £ <C" }, 

E[z] := { cr : C" -> C | a = Ej =i bi| 2 , r e N *, Pi e C[z] }> 


= { P : C n C | p(z) = T,\ a KdPa za , Pa £ C }, 

= { / : C" c I f(z) = E\ am<d fa,pz a z?, fcf, £ c }, 

= { / £ C d [z,z] | f(z) = /(z), Vz £ C n }, 

„ lj = { a : C n —>• C | u = i \Pj\ 2 > r e N *, Pi e c d[*] }■ 

Note that the coefficients of a function / £ R[z, z\ satisfy f a ,p = fp, a for all |ck| , \/3\ < l 
for some l £ N. The set of complex polynomials C[z, z] is a C-algebra (i.e. commuta¬ 
tive ring and vector space over C) and the set of holomorphic polynomials C[z] is a 
subalgebra of it (i.e. subspace closed under sum and product). The set of real-valued 
complex polynomials R[z, z] is an R-algebra. The set of sums of squared moduli of 
holomorphic polynomials E[z] and the set E d [z] C Rd[z] are pointed cones (i.e. closed 
under multiplication by elements of R + ) that are convex (i.e. tu + (1 — t)v with 
0 ^ t ^ 1 belongs to them if u and v do). Let C(K, C) denote the Banach (i.e. com¬ 
plete) C-algebra of continuous functions from a compact set K C C n to C equipped 
with the norm ||(/?||oo := sup 2eif |<p(z)|. Consider Rk : C[z, z] —> C(K, C) defined 
by / i —> f\K where f\K denotes the restriction of / to K. Rk(C[z, z]) is a unital 
subalgebra of C(K,C) (i.e. contains multiplicative unit) that separates points of K 
(i.e. u ^ v £ K => 3 ip £ Rk(C[z, z]) : <p(u) ^ <p{v)) and that is closed under com¬ 
plex conjugation. It is hence a dense subalgebra due to the Complex Stone-Weiestrass 
Theorem. Likewise, C(K, R) := {</? £ C(K,C) \ tp{z) = <p(z), Vz £ C"} is a Banach 
R-algebra of which iijf(R[z, z}) is a dense subalgebra. In other words, a continuous 
real-valued function of multiple complex variables can be approximated as close as de¬ 
sired by real-valued complex polynomials when restricted to a compact set. They are 
hence a powerful modeling tool in optimization. Speaking of which, let m £ N* and 
k, k lt ..., k m £ N. Consider (f,g i,..., g m ) £ R k[z, z] x R fel [z,z]x...x R fem [z, z] where 


(3.2) 


^d[z J 
C d[z, z] 
R d [z, z] 
E^lzl 
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there exists \a\ = k and \f3\ ^ k such that f a)( g ^ 0. In addition, for i = 1,..., to, 
there exists |a| = /c, and |/3| ^ hi such that gi^ a ,p ^ 0. Consider the problem 


(3.3) 


f opt ■= inf zeC « f(z) s.t. gi(z) > 0, i = 


where / opt := +oo if the feasible set is empty. The feasible set K := {z G C" | g,;(z) ^ 
0, * = 1,..., m} is assumed to be compact. Let K opt denote the set of optimal solutions 
to (3.3) and M(K) denote the Banach space over R of Radon measures on K. Since 
K is compact, M(K) may be identified with the set of linear continuous applications 
from C(K , R) to R equipped with the operator norm (Riesz Representation Theorem). 
For ip G C(K,C), define f K <pdp := f K Ke(ip)dp T i J K lm(ip)dp [63, 1.31 Definition] 2 . 
Consider the convex pointed cone V{K) := { ip G C(K, R) | ip(z) >0, Vz G K }. A 
Radon measure p is positive (denoted /i ^ 0) if ip G T(K) implies that f K (pdp ^ 0. 
Let M+(K) denote the set of positive Radon measures. We have 


(3.4) / opt = mi^ M{K) f K fdp s.t. S K dp = 1 & p > 0. 

Indeed, if z £ K , then the Dirac 3 measure 5 Z is a feasible point of (3.4) for which the 
objective value is equal to /(z). Hence the optimal value of (3.4) is less than or equal 
to / opt . Conversly, if /r is a feasible point of (3.4), then f K (f — f opt )dp ^ 0 and hence 

Ik f d T > Ik f optd » = /° pt Ik d » = /° pt • 

Proposition 3.1. The set of optimal solutions to (3.4) is 

(3.5) { p G M+(K) | p(K opt ) = 1 & p(K \ K opt ) = 0 }. 


As a consequence, if K opt is a finite set of S G N* points z( 1),..., z(S) G C n , then the 

_^ Cf _^ c< 

optimal solutions to (3.4) are { X)j=i I i Aj = 1 & Ai,..., As G R+ }. 

Proof. Consider p an optimal solution to (3.4). It must be that J K (f — f opt )dp = 
0. Thus I K \ K o P t(f - f opt )dp = 0 and p(K \ K opt ) = I K \ Kop t dp = 0. Therefore 
p(K opt ) = I K o Pt dp = p{K) — p(K \ K opt ) = 1. Conversly, if p belongs to the set 
in (3.5), then it is feasible for (3.4) and I K {f — f opt )dp = I K \ Kop t(f — f opt )dp = 0. 
Hence f K fdp = f K f° pt dp = f° pt f K dp = f° pt . □ 

In order to dualize the equality constraint in (3.4), consider the Lagrange function 
C : M. + {K) x R —> R defined by (p, A) i—> f K fdp + A (l — J K dp). We have 
C(p, A) = A + I K (f — A )dp and 


(3.6) 


inf 

neM+(K) 


j R (f -\)(lp= | 


0 if /(z) — A ^ 0, Vz G K, 
—oo else, 


since, in the second case, we may consider tS~ for a z G I\ such that /(z) — A < 0 and 
t —> Too. This leads to the dual problem 


(3.7) / opt = sup AgR A s.t. /(z) - A T 0, Vz G K. 

Primal problem (3.4) gives rise to the complex moment hierarchy in Section 3.1. Dual 
problem (3.7) gives rise to the complex sum-of-squares hierarchy in Section 3.2. 


2 We wish to thank Bruno Nazaret for bringing this reference to our attention. 

3 The Dirac measure 5 Z with z G K may be identified with the continuous linear application from 
C(K, R) to R defined by ip i —> <p(z). This is one way to interpret the fact that f K fdS z = f{z). 
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3.1. Complex Moment Hierarchy. Let H (respectively H d ) denote the set 
of sequences of complex numbers (ya,p)a,/3eN n (respectively ( 2 / a ,/ 3 )|a|,|/ 3 |^d) such that 
Vad 3 = y/ 3 ,a for all a,/3 £ N” (respectively |a|,|/3| ^ d). An element y £ 'H is 
said to have a representing measure p on K if p £ A4 + (K) and y a t p = J K z a z l3 dp 
for all a,/3 £ N". When y £ 'H has a representing measure on K, the measure 
is unique because Rk{C[z, z}) is dense in C(K. C). The moment problem consists 
in characterizing the sequences that are representable by a measure on K. For ex¬ 
ample, Atzmon [5, Theorem 2.1] proved that when K = {z £ C | \z\ = 1} the 
solutions are the sequences y £ H such that J2 m ,n,j,ke N c n,i Cm.fc y m +j,n+k > 0 and 
Em.ueN w mW n (y m ,n-ym+i,n+i) > 0 for all complex numbers (cj,k)j,k&* and (w n ) neN 
with only finitely many non-zero terms. Theorem 3.7 below generalizes this result. 

Consider a feasible point p of (3.4) and the sequence y £ H that has rep¬ 
resentation measure p on K. Notice that f K fdp = f K Y2\ a \ |/ 3 |^fc fa,pz a z^dp = 

E| a |,| 0 |<fc fa,p J K z a z^dp = E|a|,|^|<fc =■ L y{f) and J K dp = f K z°z°dp = 

yo,o = 1- For all p £ C[z], we have | p\ 2 gi ^ 0 on K. Since p ^ 0, this implies that 
f K \p\ 2 gidp ^ 0. Naturally, we also have J K \p\ 2 godp ^ 0 if we define go := 1. Define 
ko := 0 and d mm := max{fc, k\ ..., k m }- Consider d > d min , 0 < i < m, and p £ 
Cd-fcjz]. We have J K \p\ 2 gi dp = f K \Y^\ a \^d-k i Pc‘Z: a \ 2 {T l \j\,\s\^k i 9i,'y,sz 7 z s )dp = 
Ik (Elal.l/JKd-fc* P«.P/3^)(E| 7 |,|,5|<fe g l , 1 ,SZ^Z 5 )dp = f K E|a|,| P\<d-ki PaPp 

T lh \,\S\^k i 9in,SZ a+1 ZP +5 dp = E w ,1,,^ PaPfi E| 7 |,|^ fci 9i^ f K Z^Z^dp = 

ElaMPKd-fc PaPP E| 7 |,|^ fci 5i,7,<5 2/a+ 7 ,/3+<5 = 1 E|o:|,|^|<d-fci PaPpM d - ki (giy){a i (3) = 
P H M d _ ki (giy)p where p := (p a )\ a \^ d - ki and M d _ ki (g z y) is a Hermitian matrix in¬ 
dexed by | a |, |/31 ^ d — hi. To sum up, y is a feasible point of 

P ■= inf yen L y (f) 

(3.8) s.t. ?/o,o = 1, 

M d - ki ( g l y ) *= 0, i = 0,..., m, VO d min , 

with same objective value as p in (3.4). Automatically, p ^ / opt - Consider the 
relaxation of (3.8) defined by 



p d := inf y GHd L y (f) 


(3.9) 

s.t. 2/o,o = 1, 



M d - ki {g i y)'>pf), i = 0,. 

■>m, 


which we name the complex moment relaxation of order d for reasons that will become 
clear with Theorem 3.7. In Section 3.2, we will introduce its dual counterpart. 

Remark 3.1. Given y £ R, the function L y in this section can be formally be 
defined by the C-linear operator L y : C[5, z] —> C such that L y (z a z&) = y ai p for 
all a,/3 £ N (i.e. Riesz functional). If <p £ C[z, z] and Tp = <p, then L y (tp) = L y (<p). 
Given l, d £ N and tp £ K; [z, z], the matrix M d (tpy) can be formally be defined as the 
Hermitian matrix indexed by |a|, |/3| < d such that M d (tpy)(a, f3) := L y {tp(z)z a z /3 ) = 
E| 7 |,|i|<j ¥> 7 ,<s Va+j, 0 +s- Notice that M d (tpy)(0,0) = L y {tp). Lastly, define M d (y) := 
M d {goy) which we refer to as the complex moment matrix of order d. 

3.2. Complex Sum-of-Squares Hierarchy. Given l £ N and tp £ R/[f, z], 

define tp := (p a ,p)\a\,\p\^l- This notation is well-defined due to the unicity of the 
coefficients of p. A Notice that tp £ E;[*] if and only if tp )>= 0. Also, define (A, B)-u d ’■= 

4 The notation is ill-defined in the real case: if tp : x G R n —> E| a | \p\<l £ R, then 

the coefficients G R are not unique. Thus S| a |< 2 Z a oiX a is a real sum of squares if and only if 
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Tr(AS) where A, B £ Bd- Given d ^ d min , consider the Lagrange function Cd ■ 
B d xRxT, d _ ko [z\x.. .xE d - km [z\ — > R defined by {y, A, a 0 ,... , a m ) < — *• L y {f)+X( 1- 

yo,o)-EHo( M rf-fc(5iy).o ! i}?t d - fci - Given cq =: EjLi \Pj\ 2 , i-e. = E£=i Vj % {pj % ) H ■> 
compute C d (y, A, er 0 ,..., a m ) = A + L y (f - A) - EEo E^=o(Pj ) H M d-k t (9iyWj = 
A + T y (/ - A) - E™ 0 EjLo L y(bj| 2 3i) = A + L y (/ - A - E™ 0 a igi ). Observe that 

( rn \ ( 0 if f(z) - A - EEo Vi( z )9i(z) = 0, 

(3.10) inf L y I / — A — cq<ii 1 = < for all z £ C", 

!/e ^ V i=o / [ — oo else. 

Indeed, in the second case, there exists z £ C n such that f(z)—X—J2iLo a i( z )9i( z ) ^ 0- 
With (y a ,p) a ,p&i := ( z a z^) a ,/3eN; Lt y (f A ^E=o ®i9i) t oo for either t ^ oo 
or t —> +oo. The associated dual problem of (3.9) is thus 



Pd ■= SU PA, ff 

A 


(3.11) 

S.t. 

/ - A = El=o a i9i , 




A £ R, <Ji £ Yi d —ki [z\, i — 0,.. 

. ,m, 


which we name the complex sum-of-squares relaxation of order d. Consider 
P* ~ sup Ao . A 

(3.12) s.t. / - A = E;=o a i9i, 

Ael, ai £ £[z], i = 0,..., to. 

Proposition 3.2. We have p* d ^ p d for all d > d min and p* d —*• p* ^ p < / opt . 
Proof. The sequence (pj)d^d min is non-decreasing and upper bounded by p* £ 
RU{±oo}. Thus it converges towards some limit p* im £ RU{±oo} such that p* im < p*. 
If p* = — oo, then p* d = — oo for all d ^ d rnln and p d —> p*. If not, by definiton of 
the optimum p*, there exists a sequence (X 1 , a l 0 , ..., a l m ) of feasible points such that 
X 1 < p* and X 1 —> p*. To each l £ N, we may associate an integer d(l) £ N such 
that (X 1 , ctq, ..., a l m ) is a feasible point of the complex sum-of-squares relaxation of 
order d(l). Thus A ; < p d ^ < p*. As a result, p* imit = p*. Moreover, (p d ) d ^ d ™ in is 
non-decreasing and upper bounded by p £ R U {±oo}. Thus it converges towards 
some limit pu m £lU {±oo} such that pu m ^ p. Moreover, weak duality implies that 
P d ^ Pd, (^ p)- Thus p* < piim ^ p. It was shown in Section 3.1 that p ^ / opt . □ 
Remark 3.2. Problems (3.12) and (3.8) may be interpreted as a pair of primal- 
dual linear programs in infinite-dimensional spaces [4]. Consider the duality bracket 
(.,.) defined from R[z, z] x PL to R by {p, y) := L y (p). A sequence {p n ) n £N in R[z, z \ is 
said to converge weakly towards p £ R[z, z] if for all y € B, we have {p n , y) — > (p, y ). 
Consider the weakly continuous R-linear operator A : R[z, z} —> R[5, z] defined by 
p i —> p — Po,o- Its dual A* : B —> B is defined by y i —> y — y o,o<Vo where 
(<Vo)o,o = 1 and (<Vo)a,/3 = 0 if ( a >^) £ (°> 0 )- Indeed, (Ap,y) = (p,A*y) for all 
(p, y) £ R[z, z] x B. Consider the convex pointed cone defined by C := £[z]go + ... + 
T,[z\g m and its dual cone C* := {y £ B \ Vp £ C, (p, y) ^ 0}. If b := Af , then 

foio-v fo,o-p* = inf v eR[f,d (¥>><W s.t. Ap = b & p £ C, 

fo,a-p = sup y&n ( b,y) s.t. do.o — A*y £ C*. 


there exists some real numbers (^a,j9)|a|,|j0|<J 0 such that X]|q :|<22 = 5D| a | \/3\<l Vol,P xOLx ^• 
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Let cl(C) denote the weak closure of C in R[z, z], [2, 5.91 Bipolar Theorem ] 5 implies 
that cl(C) = C**. Below, Theorem 3.3 and Theorem 3.7 provide a sufficient condition 
ensuring no duality gap in (3.13) and cl(C) = {tp £ M[z, z] \ <p \k ^ 0} respectively. 

3.3. Convergence of the Complex Hierarchy. We turn our attention to a 
result from algebraic geometry discovered in 2008. 

Theorem 3.3 (D’Angelo’s and Putinar’s Positivstellenstatz [25]). If one of the 
constraints that define K is a sphere constraint \zi \ 2 + . • • + \z n \ 2 = 1, and if f\K > 0. 
then there exists cq, ■ ■ ■, c m £ £[z] such that f = Y^iLo a i 9 i- 

Proof D’Angelo and Putinar wrote the theorem slightly differently. Say that 
constraints g m -1 and g m are such that g m -1 = s and g m = — s where s(z) := 1 — 
|zi | 2 — ... — |z n | 2 . With the assumptions of Theorem 3.3, the authors of [25, Theorem 
3.1] show that there exists ..., a m -2 £ £[z] and r £ R[z, z] such that f(z) = 
YYiYo 2 a i( z )9i( z ) + r(z)s(z) for all z £ C”. Thanks to [24, Proposition 1.2], there 
exists Cm—i , c m . £ £[z] such that r = c m -i — <x m hence the desired result. 0 

Theorem 3.3 can easily be generalized to any sphere |zi | 2 + ... + |z n | 2 = R 2 of ra¬ 
dius R > 0. With scaled variable w = £ C", the sphere constraint has radius 1 and 

a monomial of (3.3) with coefficient c a ,p £ C reads c a ,pz a z^ = c a ,p(Rw) a (RwY = 
R\ a \+\P\ Ca i(S wJ“wA With the scaled coefficients i?H+l/3| Ca ^ ; Theorem 3.3 can then 
be applied. Reverting back to the old scale z = Rw leads to the desired result. 
Accordingly, we define the following statement which is true only when stated: 


(3.14) 


Sphere Assumption: 


One of the constraints of (3.3) is a sphere 
|zi| 2 + ... + |z„| 2 = R 2 for some R > 0. 


Corollary 3.4. Under the sphere assumption (3.14), p* d f opt and pd —> f opt . 

Proof. Theorem 3.3 implies that p* = / opt because for all e > 0, function 
/ — (/° pt — e) is positive on K. The sequences {p* d )d^d™™ and (pd)d^d min converge 
towards / opt due to Proposition 3.2. □ 

To require a sphere constraint in a complex polynomial optimization problem 
seems very restrictive and irrelevant for many problems. But in fact, a sphere con¬ 
straint can be applied to any complex polynomial optimization problem (3.3) with a 
feasible set contained in a ball |zi | 2 +... + |z n | 2 ^ R 2 of known radius R > 0. Indeed, 
simply add a slack variable z n+ ± £ C and the constraint |zi | 2 + .. . + |z„ + i | 2 = R 2 . Let 
I\ denote the feasible set of the problem in n-f-1 variables. If (zi,..., z n+ 1 ) £ K, then 
(z±, ..., z n ) £ K and has the same objective value. Conversly, if (zi, ..., z n ) £ K, 
then (zi ,..., Zn+i) £ K for all z n+ ± £ C such that |z n+ i | 2 = R 2 — \zi \ 2 ... — |z„| 2 . 
Again, the objective value is unchanged. To ensure a bijection between K and K , add 
yet two more constraints iz„+i — iz n +i = 0 and z n .|_i + z n +i ^ 0 , thereby preserving 
the number of global solutions. In that case, the application from K to K defined by 
(zi,..., z n ) i —> (zi,..., z„, sjR 2 — |Zi | 2 — ... — |z n | 2 ) is a bijection. Adding the two 
extra constraints is optional and not required for convergence of optimal values. 

As seen in Theorem 3.3, an equality constraint may be enforced via two opposite 
inequality constraints. Let hi,... ,h e denote e £ N* equality constraints in polynomial 
optimization problem (3.3). Putinar and Scheiderer [59, Propositions 6.6 and 3.2 (iii)] 
show that the sphere assumption in D’Angelo’s and Putinar’s Positivstellensatz may 


5 We wish to thank Jean-Bernard Baillon for bringing this reference to our attention. 
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be weakened to the existence of ry,..., r e G M[z, z], a G £[z], and agM such that 

e n 

(3.15) y= y; | 2 j| 2 + a(z) + a, Vz £ C n . 

j =i »=i 

If the constraints include |^i | 2 — 1 = ... = \z n \ 2 — 1 = 0, the assumption is satisfied by 
ri = ...=r n = l,CT = 0 and a = —n. In particular, there is no need to add a slack 
variable in the non-bipartite Grothendieck problem over the complex numbers [8]. 
Example 3.1. D’Angelo and Putinar [25] consider -| < a < | and problem 


(3.16) 


infzec f(z) ■= 1 - |l-| 2 + a \ z 

s.t. g(z) := 1 - \z\ 2 > 0, 


whose set of global solutions is A' opt = {z g C | \z\ = 1} and / opt = a — £ > 0. They 
prove that the decomposition f = <j q + ay g (<7o, ay G £[ 2 ]) of Theorem 3.3 does not 
hold. As a result, the optimal values of the complex sum-of-squares relaxations cannot 
exceed 0 even though / opt > 0. Indeed, if p* d > 0 for some order d ^ d min , then there 
exists A ^ ^ and ao, oy € £<*[ 2 ] such that / — A = oo + <Jig. Thus / = A + do + oy <7 
where A + oq G £d[z], which is a contradiction. We suggest solving 


(3.17) 


iof 2l ,2 2 eC 

s.t. 


f{zi,z 2 ) ■■= 1 - ||^i| z + a\zi\ 

g(zi,z 2 ) ■= 1 — |^i| 2 — l~ 2 | 2 = 0. 


For all A < / opt , there exists do G £[ 21 , 22 ] and r G R[z±, z 2 , Zi, z 2 ] such that 
f(z!,z 2 ) - A = ao(z!,z 2 ) + r(zi,z 2 )g(zi,z 2 ) for all zi,z 2 G C. Plug in Zi = z 
and z 2 = 0 and obtain f(z ) — A = < 70 ( 2 ,0) + r(z,0)g(z) for all z g C. While func¬ 
tion z 1 —> < 70 ( 2 , 0) belongs to £[ 2 ], function 2 1 —> r(z, 0) does not! Hence we do 
not contradict the fact that f = a 0 + ay g (ao,ay G £[ 2 ]) is impossible. Consider 
a = + |) = so that /° pt = Js- Notice that d min = 2 for (3.16) and 

(3.17). The complex relaxations of orders 2 ^ d ^ 3 of (3.16) yield 6 the value 
—0.3333. The complex relaxation of order 2 of (3.17) yields the value 0.0556 (« / opt ) 
and optimal polynomials < 70 ( 21 , 22 ) = 0.2780\z 2 \ 2 + 0.2776\ziz 2 \ 2 + 0.6667|22| 4 and 
r( 2 i, 22 ) = 0.9444 - 0.3889|2i| 2 + 0.6665|2 2 | 2 . 

Proposition 3.5. Assume that the sphere assumption (3.14) holds, that n > 1, 
and that y G hid is an optimal solution to the complex moment relaxation of order 
d ^ d mm . With dx '■= maxi^;^ m Ay (Ay is defined above (3.3)) and d mm < i < d, if 

1. rank M t (y) = rank M t -d K {y) (=■ S), 

( M t _ dK (y) M t -d K (ziy ) M t - dK (zjy) \ 

2. M t - dK (ziy ) M t -d K {\zi\ 2 y) M t - dK (zjZiy) > 0, for all 1 < i < j ^ n, 

\M t .- dK (z j y) M t _ dK (ziZjy) M t _ dK (\z j \ 2 y)J 

then p d = f opt and complex polynomial problem (3.3) has at least S global solutions. 

Proof. Thanks to Theorem 3.8 below, y G Ht can be represented by a measure p 
on I\ (i.e. y a< p = f K z a z^dp, V|a|, |/3| ^ t) and can thus be extended to y G %. The 
same theorem implies that p = ’Yhj-i for some S different point 2 ( 1 ),..., z(S) 

in K and some Ai,...,As > 0. In addition, yo,o = J K z°z°dp = ^f=i Ay = 1 and 


thus /° pt > Pd = L y {f) = J K fdp = E-=iA jf(zU)) > E,=iA,/ opt = / opt . We 
simultaneously deduce that p d = f opt = f(z( 1)) = ... = f(z(S)). □ 


®MATLAB 2013a, YALMIP 2015.06.26 [46], and MOSEK are used for the numerical experiments. 
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In particular, if S = 1 in Proposition 3.5, then Point 2 in Proposition 3.5 need not 
be checked for (see comment under (3.25)) and y aJ 3 = f K z a z^d5 z = z a z ^, V|a|, |/3| < 
d min for some 2 G A opt . A global solution can be read from y because 2 = ( 2 /o,/ 3 )|/ 3 |=i- 
Example 3.2. Putinar and Scheiderer [60] consider parameters 0 < a < - and 
C > , and problem 


(3.18) 


infzec f(z) := C - | 2| 2 

s.t. <7(2 ) := 121 2 — az 2 — az 2 — 1 = 0, 


whose set of global solutions is A' opt = j+ -^== | and / opt = C — \ 2a > 0. They 

prove that the decomposition of Theorem 3.3 does not hold. Since the feasible set is 
included in the Euclidean ball of radius y/C, we suggest solving 


(3.19) 


infzi.zjec f{zi,z 2 ) 
s.t. gi(z!,z 2 ) 

92(zi,z 2 ) 
93(zi,z 2 ) 
9i{zi,z 2 ) 


C-|2r| 2 

1 21 1 2 — azf — az 2 — 1 = 0, 

c'-N 2 -M 2 = o, 

i 2 2 — iz 2 = 0, 
z 2 + 2 2 ^ 0. 


Consider a = \ and C = 3 so that / opt = 1. Notice that d min = 2 for (3.18) and (3.19). 
The complex relaxations of orders 2 ^ d ^ 3 of (3.18) are unbounded. The complex 
relaxation of order 2 of (3.19) yields the value 0.6813. That of order 3 yields 1.0000, 
rank Ms(y) = rank M\(y) = 2, and Point 2 in Proposition 3.5. Thus / opt ss 1.000 
and there exists at least 2 global solutions to (3.19), and hence to (3.18). 

We next transpose [38, Lemma 3] from real to complex numbers. 

Lemma 3.6. Define s(z ) := R 2 — | 2 i| 2 — ... — |2 n | 2 . Given d G N* and y G TLd , if 
M d (y) 0 and M d _i(sy) = 0, then Tr (M d (y)) < 2 / 0,0 J2i=o r2 ‘- 

Proof Given 1 < l < d, we have Tr(Mj_i(sy)) = Mi-\(sy){a, a ) = 

2|a|<Z-l L y (s( z )z a Z a ) = X}|a|<2-1 5J| 7 |<1 S 7 , 7 2/ 7 +a, 7 +a = S|a|<Z-l,| 7 |=0 S 7>72/7+«>7+a 
+ X/|a|<Z-l,| 7 |=l s 7 ,lV 7 +a, 7 +a = S|a|<£-1 ^ 2/a,a ~ Yh |a| <Z-1,| 7 |=1 2/ 7 +a, 7 +a- We have 
M d ~i(sy) = 0 so Mi-i(sy) = 0 for all 1 ^ < d and hence Tr(M;_i(sy)) = 0. 

In addition, X/ 0 <|a|^Z 2/a,a ^ Xqa|^Z— 1,| 7 |=1 2/ 7 +o;, 7 +c«- Thus X/|a|$;z 2/a,a ^ 2/0,0 + 
A 2 ]C|a|sg;-i 2 la,a for 1 ^ Z ^ d, which proves the lemma. □ 

Theorem 3.7 (Putinar and Scheiderer [59]). Under assumption (3.14), y G U 
has a representing measure on K if and only if M d {giy) ^=0, i = 0,..., to, Vd G N. 

Proof. We provide an alternative proof using Lemma 3.6. The “only if” part is 
a consequence of Section 3.1. Concerning the “if” part, if 3 / 0,0 = 0, then Lemma 
3.6 implies that y = 0 which can be represented by /r = 0 on A'. Otherwise 
3 / 0,0 > 0 and 2 // 2 /o,o is a feasible point of problem (3.8) whose optimal value is 
f°pt f or gji j g R[f, z \ according to Corollary 3.4. If moreover f\ K ^ 0, then 
L v/vo.o (/) > /° pt > In particular, if f\ K = 0, then L y/voo (f) = 0. We may there¬ 
fore define L y/yoo : R K (C[z, 2 ]) —*■ C such that L y/yo o (<p\ K ) := L y/yoo (ip) (similar to 
Schweighofer [64, Proof of Theorem 2]). If ip G J?ic(K[f, 2 ]), then L y / yo o (Halloo — p) ^ 

0 and L y / yoo (ip) < |M|oo- Linearity implies that \L y / yoo (ip)\ ^ |M|oo- As a re¬ 
sult, for all <p G i?if(C[ 2 , 2 ]), we have \L y / yoo (ip)\ = \L y/yoo (Re(ip) + ilm(<p))| = 

l^'!//z/o,o(^' e ( < / :3 ))+^y/'!/o,o(^ m (+))l ^ l^'a/j/o,o(^' e ( < / : ’))l + l^'y/yo,o(^ m (v 5 ))l ^ ||R- e (+)||oo + 

||Im(yj)|| 00 < 2||(/?|| 00 . Moreover, Rk{C[z,z]) is dense in C(A,C). Therefore L y / y 
may be extended to a continous linear functional on C(A, C) (we preserve the same 
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name for the extension). K is compact thus the Riesz Representation Theorem im¬ 
plies that there exists a unique Radon measure p such that L y / yo 0 (<p) = f K pdp for 
all p G C(K, C) and p > 0 because <p G 'P(K) implies that L y / yoo (p) ^ 0 (density 
argument). Finally, if a,/3 G N", y a ,p/yo,o = L y / yo o (z a z^) (Remark 3.1) so y has 
representing measure yopp on K. □ 

Theorem 3.8. Let n > 1 and y G Tl d with d ^ dx = maxi^j^ m ki (ki is defined 
above (3.3)). Assume that K contains the constraints \zk \ 2 ^ R 2 , k = 1 ...n, for 
some radii Rk ^ 0 or the constraint Y^k=i |~fc| 2 = R 2 for some radius R > 0 ( where 
^ is an equality or an inequality). Then there exists a positive r&nkM d - dK (y)-atomic 
measure p supported on K such that: 

(3.20) Va ,0 = [ z a z^dp , for all |a|,|/3|<d 

J c n 


if and only if: 

1. M d {y) J?= 0 and M d -ki{9iy) > 0, i = 1.. .m; 

2. vankM d (y) = rankM d _ djf (?/); 

/ M d _ dK (y) M d _ dK (ziy) M d _ dK (zjy) \ 

3. ( M d — d-K (■ ZiV) M d —cLk (I Zi\ 2 y) M d — d,K i Z j Ziy) > o, VI < i < j < n. 

\M d - dK (zjy) M d _ dK (ziZjy) M d _ dK (\zj\ 2 y)) 

Moreover, for each 1 ^ i ^ m, the measure p has exactly i'&nkM d (y)—i'&nkM d _ dK (giy) 
atoms that are zeros of gi. 

Proof. (•<=) Point 1 implies that (2/ a ,/3)|a|,|/3|^d <?= 0. Thus there exists a com¬ 
plex matrix x of the same size as (y a ,p)\ a \,\p\^d such that we have the Cholesky 
factorization (y a ,/ 3 )|a|,|/ 3 |^d = x H x. Let (:Ea)|a|<d denote the columns of x. Also, 
let C d denote the column space and consider the inner product ( u,v)c d := u H v 
and its induced norm ||.||c d - We have y a = (x a ,xp)c d for all |ck|,|/?| < d. Let 
V := span(a; Q .)| Q ,|^ ( i C C d . Point 2 implies that V = span(a; ct )| Q ,|^ ( i_ 1 . Given 
1 ^ k ^ n, define the C-linear operator Tk : V —> V such that TkX a = x a + ek 
for all |a| ^ d — 1 where e*, is the row vector of size n that contains only zeros apart 
from 1 in position k. This shift operator is well defined because each element of V has 
a unique image by Tk- Indeed, consider some complex numbers (u a )| a |^d-i- The as¬ 
sumption on K in the case of multiple constraints and Point 1 imply that M d _i[(R 2 — 
l z fc| 2 )y] 0- Thus || E|a|<d-1 u a x a+e k |lc d = E |a|, |/3| ^d-l i X a+e k , £/3+e fe )c d 'h a U 0 = 

E|c(|,|/3|<d-l Da+e k ,P+e k UaUp ^ R 2 E|a|,|£|<d-1 Va,pU a Up = R 2 E |a| ,|/3| ^d-1 ( Xa J X d)Cd 
u a up = -R^H E| a |<d-i M “ a '“llc d - Thus Tk is well-defined and bounded by Rk- The 
assumption on I\ in the case of a single constraint and Point 1 imply that M d _i[(R 2 — 

E"=i \zj\ 2 )y] > o. Thus II J2 |a|<d-l u oc x a+e k llc d < E"=i HE, c*|<d-l u aX a +ej |lc d ^ 
i? 2 1| E| a |<d-i u aX a \\c d - Hence Tk is well-defined and bounded by R. 

Clearly, (Tj,..., T n ) is a pair-wise commuting tuple of operators on V. Let’s now 
prove that (T*,... T*, Ti, ..., T n ) is a pair-wise commuting tuple of operators, which 
reduces to showing that T*Tj — TjT* = 0 for all 1 < i < j ^ n (where (•)* stands 
for adjoint). To do so, consider 1 Sj i < j Sj n and u,v,w G V. Point 2 implies 
that V = vec(x a )\ a \^ d _ dK . Thus there exists some complex numbers {u a )\ a \^d-dK> 
(l J ct)|a|^d—dft:; and (^a)|ct|^d—dx SUCh that U E|a|^d—djt ^ aXa 7 ^ E|a|^d— 

and w = E|a|^d-dx w a x a . Given k G N and <p G Ck[z,z], notice that (u, ip(T)v)c d = 

E| 7 |,|tf|<fc T 7 ,s( T ' y ^ T Mc d = E|a|,|/9|<d-d*: E| 7 |,|tf|<fc ^,s(T^X a ,T S Xp) Cd U a Vp = 

E|a|,|/3|^d-dx E H)H < fe < fm,s{ x a+'y, Xp +d )c d U a Vp = . . . 
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^ |a| ,|/3|^d— dx (^ 
(3.21) 


v|7|,|5|<fc VliSVa 

t -\- 7 , (3+S ) ^ a V ft 

/ ( U \ 

(i 

rji* 

i 

rp* 

1 i 

(M 

A 

T*Ti T*Ti 

\ W 

\Tj 

T*Tj 

TjTj 


-!H, 



Cd X Cd X Cd 



H 

/ u\ 

, / 

1 - 

f 

v 



1 \ 

\w) 

\ 



Md—d K (y') Md-d K {Ziy) Md—dK^jV) 
Md 

—d,K ( Ziy ) Md — (Lk ( \zi\ 2 y ) M d —dK {zjZiy) 
M d —dx { z jy ) Md—dx ( z i z j y ) Md~ <lk(\ zj\ y) 






which is equivalent to the fact that Schur complement satisfies 


(3.24) 


(T*Ti T/TA 

\t;t 3 t;t 3 ) 


(T t T* 

\t 3 t* 


TiT*\ 
T jTj ) 


> 0. 


Thus T*Ti — T{T* > 0 and T*Tj — T 3 T* 0. Since their trace is zero, we in fact have 
that T*Ti — TjT* = 0 and T*T 3 — TjT* = 0. Going back to the Schur complement 
(3.24), we thus have T*Tj — TjT* = 0. 

Having proven that (T*,... T*, Tj,..., T n ) is a pair-wise commuting tuple of op¬ 
erators, it follows that they are commonly diagonizable. In other words, there exists 
orthogonal projectors E \,..., E p of V such that EjEj = 0 for all 1 < i ^ j < p and 
there exists some complex numbers (Aj)i<i<n such that T j. = Y^j=i ^k,j E j for all 
1 ^ k < n (and thus = YTj=i A k,j E j ). For all |a|, |/?| ^ d, we thus have y a ^ = 
{x a ,xp) Cd = {T a xo,T p xo)c d = (x 0 , (T*) a T^x 0 )c d = {xa,YJj=i ^j X j E j x o)c d = 

X)j =1 A “\j (xo, EjXo)c d - Naturally, the number of projectors satisfies p ^ dim(F) = 
rankMd(y). Conversly, rankM d (y) = rank(y ai/ 3 )H,|/ 3 |<d < p. Hence p = rankM d (y), 
the elements Ai,...,A p are all distinct, and ( xo,EjXo)c d > 0 for all 1 ^ j ^ p. 
Thus p := ]Tj=i( x o, E j x o)c d Sx 3 is a positive rankMrf_d K (y)-atomic measure that 
satisfies (3.20). Given 1 < j < p, let’s show that \ 3 := (Xk,j)i^k^n G K. There 
exists u £ V \ {0} such that T k u = A k ,jU for all 1 ^ k ^ n. Normality im¬ 
plies that T£u = \k,jU. Hence ( T*) a T^u = XJXjU for all a,/3 £ N. As a result, 
9 i{Xj)\\u\\ 2 Cd = (u,gi(Xj)u) Cd = (u,g l (T)u) Cd = u H M d _ dK {g iy )u > 0. Thus X 3 £ K. 

(=>) Let p := i'&nkM d _ dK (y), and let (Aj)i<.j<p and (rrij)i^j^p denote the dis¬ 
tinct atoms and their positive weights respectively. Let x a := (^/fnjX'j £ C p for 

all |a| ^ d and V := vec(:r a )|a|^;d- With these notations, we have y at p = (x a ,xp)c d 
for all |a|, |/3| ^ d. Notice that p = rank M d - dK (y) < ra,nkM d (y) = dimF < p, thus 
Point 2 holds. Given 1 < k < n, let T k := diag(Afc i i,..., Xk, P )- It satisfies the shift 
property TkX a = x a + ek for all |a| ^ d — 1 and = diag(Afc.i,..., A k, P )- Moreover, 
the shifts and their adjoints are pair-wise commuting so (3.24) holds and thus Point 
3 does too. Let’s now prove Point 1. Consider 1 ^ * St m and some complex num¬ 
bers (ita)|a|sjd—fci =: u. Let u := J2\ a \^ d -ki U » x a and u ='■ ( u j)i^j^p S C p . We 
have u H M d _ ki {g iy )u = {u,gi(T)u) Cd = (u,[gi(X 3 )u 3 ] p j= 1 )c d = Y%=i 9 i(Xj)\uj \ 2 ^ 0. 
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Hence dim Ker gi(T) (= p — rank gt{T) due to the rank-nullity theorem) is equal 
to number of atoms that are zeros of gi. To conclude, notice that rank g,(T) = 
rank {{x cn g i {T)xp)c d )\ a \,\p\^d-d K = rankM d _ dic (giy). □ 

In the univariate case n = 1, Theorem 3.8 holds when Point 3 is replaced by 


(3.25) 


( M d - dK (y) M d -d K {zy) \ . n 

\M d - dK (zy) M d _ dK \\z\ 2 y)J 


In Theorem 3.8, if we assume that j/o,o > 0, then Point 2 and Point 3 may be replaced 
by rankMd(y) = 1. Indeed, in that case, the shift operators act on a one dimensional 
space, so they and their adjoints must commute pair-wise. For previous work on 
the link between linear functionals that are nonnegative on a quadratic module and 
bounded operators that admit a cyclic vector, see [57] and [22, Theorem 2.3]. 

Corollary 3.9. Let y G H d be a Hankel matrix (i.e. y a ,p = y^,s for all 
|a|, |/?|, |'y|, |(5| ^ d such thata+fd = j+6). Then there exists apositive ra,n\tM d _ dK (y)- 
atomic measure g supported on K such that: 

(3.26) y at p = [ z a z^dg , for all |a|, \/3\ < d 

J C" 

if and only if: 

1. M d (y) ^ 0 and M d _ dK (giy) ^ 0, i = 1... m; 

2. rankM d (j/) = r&nkM d - dK (y). 

Moreover, for each 1 ^ ^ to, the measure g has exactly ia,nkM d (y)—ia,nkM d - dK (giy) 

atoms that are zeros of gi. 

Proof. (=>) Same as in proof of Theorem 3.8. (<==) The Hankel property im¬ 
plies that the shifts in the proof of Theorem 3.8 are well-defined and self-adjoint. 
Indeed, consider 1 ^ k ^ n and u,v G V. According the Point 2, there exists 
some complex numbers (u a )\ a \^ d _i and (r’a)|a|^d-i such that u = Xq a |<d-i u a.x a 
and v = Y / \ a \^d-i v » x a- Sial^d-i^A^a = 0, then for all |/3| < d- 1, we have 
(S|a|<d -1 u aXa+e k , Xp)c d = S|o|<d-l ^a(x a +e k , x p)c d = 5Z|a|<d-l ^a2/a+e fc ,/3 = 
S|a|<d -1 ^a2/a,/3+efc = (S|a|<d -1 u ot x a, x P+e k )c d = 0, and hence X]|a|^d -1 u ot x a+e k = 
0. Tfc is thus well defined. Moreover, we have T£ = because ( TkU,v)c d = 
(Sq u a x a+e k , Yla v a x a)c d ~ Sa,/3 j x p)c d = U a Vpy CL + eki p = 

Ea,/3 Ra^/32/a,/3+efe = ( U,Ti~v)c d ■ HI 

A Hermitian matrix that is a Hankel matrix is real symmetric. Hence the atoms 
in Corollary 3.9 lie in K Corollary 3.9 is thus the same as [42, Theorem 3.11] 

due to Curto and Fialkow [21, Theorem 1.1], This observation leads to Figure 2. 

Next we transpose [38, Theorem 1] from real to complex numbers. 

Proposition 3.10. Under assumption (3.14), p* d = p d G RU{+oo},Vd ^ d mm . 

Proof. Given A G Hd, consider the operator norm ||A||, the largest eigenvalue 
of A in absolute value, and the Frobenius norm ||A||f := i/( A , A)u d - Consider 
d ^ d mm . Two cases can occur. Case 1: the feasible set of the complex moment 
relaxation of order d is non-empty. All norms are equivalent in finite dimension so 
there exists a constant C d G IR. such that for all feasible points ( 2 / a ,/ 3 )|ct|,|/ 3 |^d we 

have ijYf\ a \m^d\y<*A 2 = \\ M d(y)h < Cd \\M d {y)\\ ^ C d Ef=Q i?2/ according to 
Lemma 3.6. As a result, the feasible set of the complex moment relaxation of or¬ 
der d is a non-empty compact set and so is its image by A (defined in (2.3)). We 
can thus apply Trnovska’s result [71] which states that in a semidefinite program in 
real numbers, if the primal feasible set is non-empty and compact, then there exists 
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inf z gC" f(z) 
s.t. gi(z) 3? 0 

Complex 
Hierarchy 
y a ,p = f K z a z p dp 


Hankel property 


Va + /3 = 7 + S 


inf x eR" f{%) 
s.t. gi(x) ^ 0 

Real 

Hierarchy 
2/a = f K x a dp 


lll ^Va,p Ly(f) S.t. 

Md-kiigty) > 0 

2 /o,o = 1 


Hankel property 


yoi,p — y^,s 
Va + ft = 7 + S 


V 

ini y a L y(f) s.t. 

Md-ki (giy) > 0 

2/o = l 


Fig. 2. Commutativity of Relaxation and Hankel Property 


a dual interior point and there is no duality gap. Case 2: the feasible set of the 
complex moment relaxation of order d is empty, i.e. pd = +oo. It must be strongly 
infeasible because it cannot be weakly infeasible (see [27, Section 5.2] for definitions). 
Indeed, if it is weakly infeasible, then there exists a sequence (y 3 )jen of elements of 
n such that for all j <E N, we have \y J 0 0 - 1| < and A mi „(M rf _ fci {gty 3 )) ^ 
where i = 0, Define c := (n + d)\/{n\d\). We now mirnick the computa¬ 
tions in Lemma 3.6 using y J 0 0 ^ 1 + ^ 2 and |Tr(Mj_i(sj/ J '))| ^ ^ c 

if 1 SC l ^ d. Consider jo £ N such that for all j ^ jo and 1 ^ l ^ d, we 
have E|a|<i-i,|-y|=i 2 / 7 +a, 7 +« “ Eo<|a|^z 2/a,a > “l-The concluding equation in the 

proof of Lemma 3.6 then becomes E| a |^z 2/a,a < 2 + R 2 (E|a|<z-i 2/a,a) + c + 1. 
As a result, Tr(Md(y J )) = E|a|^<i2/a,a < ( 3 + c ) Ef= o r2 \ which, together with 
Amin {Md{y J )) 3* - > -I, yields A max (M d (y J )) < (3 + c) £)f =0 i? 2Z + c - 1. Hence 

for all j A jo, the spectrum of Md{y 3 ) is lower bounded by —1 and upper bounded 
by B d := (3 + c) R 21 + c - 1 ^ 1. We therefore have ^E|a|,|^|<d Wo.,?? < 
Cd ||M d (j/)|| < Cd x The sequence {y 3 )j^j 0 is thus included in a compact set. 
Hence there exists a subsequence that converges towards a limit y llm which satisfies 
2 /0™ = 1 an d the constraints \ m m(Md-ki(giy hnl )) 0, i = 0,... , m. Therefore y llm 
is a feasible point of the complex moment relaxation of order d , which is a contra¬ 
diction. Strong infeasibility means that the dual feasible set contains an improving 
ray [27, Definition 5.2.2]. Moreover, inf y e'H d L y (f) subject to yo,o = 1, Md(y) 

0, and Md-i(sy) = 0 is a semidefinite program with a non-empty compact feasible 
set hence the dual feasible set contains a point (A, cto Wi)- As result (A, coj 0 T, 0,..., 0) 
is a feasible point of the complex sum-of-squares relaxation of order d. Together with 
the improving ray, this means that p* d = +oo. To conclude, p* d = pd in both cases. □ 

Proposition 3.11. Assume that (3.3) satisfies (3.15) and has a global solution 
z opt g A' opt . In addition, assume that (crg pt ,..., cr^) £ £[z] rra+1 is an optimal 
solution to the sum-of-squares problem (3.12). Then (z opt ,cr° pt ,... ,<7^*) is a saddle 
point of ip : C n x £[z] m —> R defined by ( z,a) i —> f(z ) — E™i a i( z )gi{ z )- 

Proof. The optimality of (crg Pt ,..., cr° pt ) means that /—/ opt = EEo With 

/(z opt ) - / opt = Et=o <7° pt (z opt )3 , j(z opt ) = 0, cr° pt (z opt ) 3? 0, and gfiz opt ) 3? 0, we 
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have a° p \z opt )gi(z opt ) = 0 for i = 0,...,to. It follows that </>(z opt , cr) < <j>{z° p \ cr opt ) 
for all cr G H[z]. For all 2 G C n , <j)(z opt , a opt ) < <^(,z,cr opt ) because /(z) - / opt - 

E”= l = o"o Pt (^) > □ 

Given an application : C n —> K, define dp : R 2ra —> R by (x, y) \—> ip(x + i y). 
If dp is R-differentiable at point (x,y) G R 2n , consider the Wirtinger derivative [75] 
defined by Vip(x + i y) := \{V x dp(x, y ) - i V y dp(x, y)) G C". 

Corollary 3.12. With the same assumptions as in Proposition 3.11, we have 


(3.27) 


V/(z opt ) = ^™ lt T° pt (z opt )V ffl (z opt ), 
a op \z opt ), gi (z° pt )>0, i = 1,..., m. 
a° pt {z opt )g z {z opt ) = 0, i = 


Proof. z° pt is a minimizer of z G C ra i—*■ 0(^r, cr opt ) thus V z </>(z opt , a opt ) = 
V/(z opt ) - J2Zi Va° pt (z opt )gi (z opt ) - YZU o-° pt ( 2 opt )V 5 i(jz opt ) = 0. Consider 1 < 
i < m. Since cr° pt (z opt ) = 0 and cr° pt G E[z], it must be that |zfc — z£ pt | 2 divides 
< pt : Zk £ C i ► < pt (^ pt , ■ • •, ^° Pt i, . 4 pt )- With z opt =: x° pt + iy° p \ 

the real number x^ pt is a root of multiplicity 2 of Xk G R i —> <r° P k (xk+iyk'*)’ with an 
analogous remark for (/£ pt . Thus Vcr° pt (.z opt ) = 0 which leads to the desired result. □ 


sup A6M A 
s.t. Vz G K, 
f(z) — A > 0 


Complex 


Hierarchy 


Identify 
real and 


imaginary 

parts 


sup A6M A 
s.t. Mx + iy G K, 
f{x + iy) - A ^ 0 

Real Hierarchy 



m ( ri 

E E 

i=0 \i=i 


Identify 
real and 


suPa 6 kA s.t. 

Mx,y GR”, f(x+iy) — \ = 


|a+/3| ^.d—ki 


9i{x + iy) 


imaginary 

parts 




SUPAgH A S.t. 


Vs, y G R", f(x + iy) 

- A = 

( r 


2 \ 

E E 

i=0 V = 1 

E p)A x + ’ i y) a 

\a\^d—ki 

1 9i{x + iy) 


Fig. 3. Comparison of Real and Complex Hierarchies 

3.4. Comparison of Real and Complex Hierarchies. In Figure 3 where 
P),a’P % j,a,i3 ^ C, the real sum-of-squares hierarchy is artificially written using squares 
of moduli of complex polynomials. It thus yields bounds superior or equal to the 
complex hierarchy. For example, at order 2, the real hierarchy yields 1.0000 while the 
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complex hierarchy yields 0.6813 for (3.19). However, the size of the largest semidefinite 
constraint in the complex hierarchy when converted to real numbers, i.e. 2 x card{a G 
N" s.t. |a| ^ d} = 2 (n + d)\/(n\d\), is far inferior to that of the real hierarchy, i.e. 
cardja, /? G N n s.t. \a + j3\ < d} = (2 n + d)\/((2n)\d\). At fixed d, the size reduction 
converges towards 2 d ~ 1 as n —>• oo. Further reduction is possible (Section 3.5). 

3.5. Invariant Hierarchy. We generalize and transpose to complex numbers 
the work in [62] (see also [19]). Let (G, x) denote a compact group whose unit we 
denote 1. First, consider the continuous action of G on C” via A : G x C" —> <C n 
such that A(l,z) = z, A(g i x </ 2 ,z) = A(gi, A(g 2 , z)) for all z £ C n and gi,g 2 £ G. 
Second, consider the action of G on R[z, z] via A! : G x R[z, z] —> R[z, z] defined 
by A'(g,ip) := ip(A{g,z)). Third, consider the action of G on the set B(K) of Borel 
subsets of K via A" : G x B(K) — B(K ) defined by A"(g, B) := {z £ I\ \ A(g , z) £ 
B}. Last, consider the action of G on M(K) via A!" : G x Ai(K) —> M(K) defined 
by A'"(g,fJ,)(-) := y(A"(g, •)). Given a set S on which G is acting via T and Y C S, 
let Y g := {y £Y \ Vg £ G, T(g,y) = y}. If f,gi, ■ ■ ■ ,g m £ ®[z,z] G , then: 

(3.28) / opt = inf fj,e M ( K )a f K fdy s.t. f K dp, = 1 & n > 0. 

If /i is feasible for (3.28), then f K \p\ 2 gidp ^ 0 for all d £ N and p £ C<j[z] such that 

|p| 2 G Yid[z] G . Given A £ Bd, let A ^- G 0 therefore denote p H Ap ^ 0 for all p G Cd[z] 
such that |p| 2 G Yd[z] G . This yields a G-invariant hierarchy for all d ^ d min 

Pd ■= inf y eH d L v (f) 

(3.29) s.t. ?/o,o = 1, 

Md-kt ( giy ) > G 0, * = 0,..., m, 


( Pd )* : = su Pa ,<r A 

(3.30) s.t. / - A = Ya =o o-ifli. 

AgI, <Ji £ Yd-ki [z] G , i = 0,...,m, 

whose convergence we now discuss. Assume that the first 2e (e G N*) constraint func¬ 
tions gi, ■ ■■ ,g m form equality constraints (i.e. (/ 2 i-i = —gn =: hi, i = 1... e). Define 
S := Y[z] G + J2Ui z ] G hi (and S := S[z] G if there are no equality constraints). 

Proposition 3.13. Assume that f,gi,---,g m G K[z, z] G and that R[z, z] G = 
S + R. If f\K > 0, then there exists tr 0 ,..., a m £ E[z] G such that f = X)™ 0 cq</j. 

Proof. By definition of A ', R[z, z] G is an R-algebra. As a result, S' is a semiring of 
R[z, z] G (i.e. contains R + and is closed in R[z, z) G under taking sums and products) 
and M := S + J2k=2e+i 2[z] G <7i i s an S-module of R[z, z] G (i.e. contains 1 and satisfies 
M + M C M and SM C M). The conclusion then follows from [60, Theorem 2.6]. □ 
Proposition 3.14. The torus G = T in C with the action A(g , z) := gz satisfies 
Pd = Pd and ( p T d Y = p* d for all d > d mm if /, g u ..., g m £ R[z,z] T . 

Proof. Firstly, ip £ R[z, z] T if and only if Va, /3 £ N", \a — j3\tp a ,p = 0. Indeed, for 
all 9 £ R and z G C n , p(e w z) = E Q ,/ 3 eN" P»,p{e w zf {e ie z) 0 = XX )/3eN „ <p a ,pe i (M-\ a \'> e 
z a z 0 is equal to ip(z) if and only if tp a> p = 0 or \/3 — a\6 = 0[27r] (i.e. |/3 — 
a\ = 0). Secondly, if a £ E[z], i.e. cr = bil 2 > then I]| a |=|/ 3 | <x a ,pz a z 0 = 

E; e iEjIi IE| a |=;Pha^ a | 2 G S[z] T . Thirdly, if (A, <r 0 ,..., a m ) is feasible for (3.11), 
then (A, X)|a|=|/3| cr o,a,/3Z Q z /3 ,...,X)|a|=|/3| fT ™,a,/3^ a2:/3 ) is feasible for (3.30). Thus 
(p’d.)* = Pd- Lastly, if y is feasible for (3.29), then (y a ,p6 | a |=|/ 3 |)|a|,|/ 3 |<rf is feasible 
for (3.9) (where 6 is the Kronecker symbol). Hence pj = Pd- d 
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If /, ( 71 ,..., g m £ R[z, z] t , then the minimum order d mm of the complex hierarchy, 
i.e. max{|a|, |/3| s.t. \f a ,p\ + \gi, a ,p\ + • ■ • + \g m ,a,p\ ^ 0 }: is equal to that of the real 
hierarchy, i.e. max{[(|a| + |/3|)/2] s.t. \f a ,p\ + \gi, a ,p\ + • ■ ■ + \gm,a,p\ ^ 0}, where [.] 
denotes the ceiling of a real number. 

Proposition 3.15. The subgroup G = {—1,1} of T with A(g,z) := gz satisfies 
p { d ~ 1,1} = Pd and (p { d ~ 1,1} )* = p* d for all d > d mm if f,gi, ...,g m £ K[z, 2 ] {_1,1} - 

Proof. Firstly, ip £ K[z, if and only if V|a + /3| odd, ip a ^ = 0. Secondly, if 

a G E[z], i.e. a = Ej=l \pj | 2 , then E|a +; S|even <7 a ,0Z a Z^ = Ej=l I £|a|evenPj >*°| 2 + 
IE|«|odd Pj,o> z0l \ 2 G S[z] {_1 ' 1} . Thirdly, if (A, cr 0 ,..., cr m ) is feasible for (3.11), then 
( A J E| a + / 3 |even cr o,a,/ 3 ^“^ / 3 ,---,E|a+/ 3 |even cr ™,«,/ 3 ^“ 2 /3 ) is feasible for (3.30). Lastly, if 
y is feasible for (3.29), then (y a ,; 8 <S|a+/ 9 |even)|a|,|/ 9 |^d is feasible for (3.9). □ 

A problem with T-invariance in complex numbers converts in real numbers to a 
problem with {—1, l}-invariance. If a £ £d[z] T , then (< 7 a,/ 9 )|a|,|/ 3 |<d has a (d + 1)- 
block-diagonal structure, whereas if er £ T, d [x, y]i _1,1 i with z =: x + i y, then it has a 
2 -block-diagonal structure (after permutation) whose 2 blocks are much bigger. 

3.6. Multi-Ordered Relaxation. We generalize and transpose to complex 
numbers the work in [53]. The idea is to associate a relaxation order to each con¬ 
straint. In addition, we consider the coupling of the variables induced by the monomi¬ 
als present in the optimization problem, to which we add the coupling of the variables 
induced by only some constraints (those with a “high-order”). For instance, the cou¬ 
pling induced by the mononials in g\{z\, Z 2 , z 3 ) := $i{zi(z 2 +Z 3 )} ^ 0 is {( 1 , 2 ), ( 2 ,3)}, 
while the coupling induced by the constraint is {(1,2), (1,3), (2,3)}. 

Given a £ N ra with n > 1, let supp(a) := {1 < s ^ n | a s / 0}. Consider 
the coupling induced by monomials defined by £ mono ~ {(/,m) | l ^ to s.t. 3a, /3 £ 
N” s.t. {I, to} C supp(a + /?) and \f a ,p\ + \gi,a,p\ + + \gm, a ,p\ 0}. Given 

I C {l,...,n}, let z(I) := {zi | * £ 1} if I ^ 0, else z(C) := 1. Given y £ Ti, 

d £ N, and p £ K[z(7), z(/)], let M d (py,I) := ( M d (py)(a,/3) ) S upp(a),supp(/ 3 )c/■ 

Let Gi, ..., G m C {1,..., n} denote the minimal sets in terms of inclusion such that 
{gi,---,g m ) G Mfc 1 [z(Gi),z(Gi)] x ... x R km [z(G m ), z(G m )]. 

Let (di,... , dm) £ N m be such that di — ki ^ 0 for all 1 < i < m. Consider 
the coupling induced by monomials and high-order constraints defined by £ con := 
^mono u (J di>fci {(/, m) \ l ^ m s.t. { l , to} C Gi}. Let Gi,..., C p C {1,..., to} denote 
the maximal cliques of a chordal extension of ({1,..., n}, £ con ). Given 1 ^ i ^ to, let 
h := U i &L Ci where L £ argmin{E; e i |C*| | Gi C U i eL Ci, L C {1,... ,to} }. For i 
such that di > ki, L is a singleton due to the definition of £ con . Define (dj 1 , • • •, d }}) £ 
N p such that df := min{di,..., d m } if Ci ^ Ii for all 1 < i < to; if not, let df := 
max{dj | h = Ci}. Define the relaxation of order (di,..., d m )by 

Pdi,...,d m := fof y^'H L y (f) 

(3 311 s - t ' yo ’° = 

1 j M d c.i(y, Ci) 0 , l = l,...,p, 

M di - ki (giy,Ii) >?= 0 , * = 1 ,.• •,to, 


Pdu-,d m : = su Pa,ct 

A 

s.t. 

f — ^ = Ef=i ( a oj + EciC/i ’ 


<7o,1 G £ d oi [z(Ci)], l = l,...,p, 


di £ T, di - ki [z(Ii)], i = l,...,m. 


(3.32) 
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3.7. Multi-Ordered Hierarchy. Given 77 : N —> [fci,+oo[x ... x [A; m ,+oo[ 
such that min,i_>.+oo 77(d) = + 00 , consider the sequence indexed by d £ N of re¬ 
laxations of order 77(d). We refer to such a sequence as multi-ordered hierarchy. 
The uniform case where H(d) := (d + d mm ,... ,d + d mm ) is a special case of the 
sparse real hierarchy of [74] when transposed to complex numbers. The hierarchy 
of [74] converges to the global value of a real polynomial optimization problem if a 
ball constraint is added for each clique of a chordal extension of the sparsity pat¬ 
tern [42, equation (2.29)]. The same holds in the complex case if a slack variable and 
redundant sphere constraint is added for each clique. The proof is the same as in 
the real case [42, Lemma B.13 and 4.10.2 Proof of Theorem 4.7] once the real vector 
spaces on which measures are defined are replaced by complex vector spaces. (For 
other proofs of [42, Theorems 2.28 and 4.7], see [31] and [40].) For d £ N great enough, 

i.e. once di > ki for all 1 ^ i ^ m, the relaxation of order 77(d) = (d\ .... ,d m ) is at 
least as tight as the complex sparse relaxation of [74] of order min 77(d). Any multi- 
ordered hierarchy thus globally converges (if a slack variable and sphere constraint is 
added for each clique). 

3.8. Example of Multi-Ordered Hierarchy: the Mismatch Hierarchy. 

Conceptually, the mismatch hierarchy is defined by the following procedure. Until a 
measure can be extracted from a solution y to the multi-ordered relaxation, 

1 . compute a solution y to the moment relaxation of order (di,..., d m ); 

2 . find a closest measure y to y not necessarily supported on I\ : 


(3.33) 


arg min 
/i Dirac 


da,/3 


L 


z a z^dy 


H,101=1 


F 


3. increment di = di + 1 at the highest mismatch, that is to say: 


(3.34) 


arg max 


'y ] 9i,a,P ( Va.f 3 

OL,P ' 


z a z 0 d 



Stricly speaking, we refer to the mismatch hierarchy as the following recursively de¬ 
fined multi-ordered hierarchy 77. It depends on 3 parameters: a mismatch tolerance 
e > 0 ; the number h £ N* of highest mismatches considered at each iteration; and 
an upper bound on the difference between maximum and minimum orders, i.e. 

{maxF(d) — min 77(d) | d £ N}. 

Initialize by 77(0) := k\ x ... x k m and let’s define 77(d + 1) in function of 
77(d). We distinguish two cases. Case 1: if there exists no solution to the moment 
relaxation of order 77(d), then let 77(d + 1) := 77(d) + (1,..., 1). Case 2: if not, 
consider a solution y. For 1 ^ l ^ p, consider some complex numbers ( u(l)j)jeCi 

such that u(l) u{l) H is the closest rank 1 matrix to y(l) := (da,/ 3 )l“pp(t l ) 7 S upp( / 3 )cC 1 
with respect to the Frobenius norm. Let Xi{l) ^ A 2 (Z) ^ 0 respectively denote 
the first and second largest eigenvalues of y(l). Let 9 £ R p be a minimizer of 
Ef, m =i Ejec l nc m ( ai su(l)j + 61 — arg u{m) 3 - 6 m ) 2 s.t. 9 £ [0,2tt]p. Let z G C" be 
a minimizer of Ea 2 ( 0 ^o Xi(l)/\ 2 (l)\\z(Ci) - u(l)e l9l \\l + 2max{Ai(Z)/A 2 (Z) | X 2 (l) ^ 
0} x Ea 2 (,)=o II z (Ci) — u(l)e l61 H 2 - We distinguish 3 cases: 

• Case 2 . 1 : M. := {1 ^ i ^ m \ \L y {gi) — gi{z)\ > e and 77,;(d) < max77(d)} 7 ^ 0 

• Case 2.2: M = 0 and M' := {* G S \ \L y (gi) — gi(z)\ > e} 7 ^ 0 

• Case 2.3: M = M' = 0 
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In Case 2.1, let Hj(d + 1) := Hj(d) + 1 if Ij C I* and i has one of the h highest 
mismatches \L y (g i ) — gi(z)\ among i £ M. For all other 1 ^ j ^ to, let Hj(d + 1) := 
Hj(d) unless the bound A™f x is violated, in which case for all 1 < j < m such that 
Hj(d) = min H(d), let Hj(d + 1) := Hj(d) + 1. In Case 2.2, apply instructions of 
Case 2.1 where M is replaced by M !. In Case 2.3, let H(d + 1) := H(d) + (1,..., 1). 
Observe that min H(d) > max H (d) — A™f x —> +oo as d —> +oo. 

4. Application to Electric Power Systems. The optimal power How is a 
central problem in power systems introduced half a century ago in [14]. While many 
non-linear methods [16,77] have been developed to solve this difficult problem, there 
is a strong motivation for producing more reliable tools. Since 2006, the ability of the 
Shor and second-order conic relaxations to find global solutions [3,7,20,34,47,52,69] 
has been studied. Some relaxations are presented in real numbers [43,54] and some in 
complex numbers [9,10,76]. However, in all numerical applications, standard solvers 
such as SeDuMi, SDPT3, and MOSEK are used which currently handle only real 
numbers. Modeling languages such as YALMIP and CVX do handle inputs in complex 
numbers, but the data is transformed into real numbers before calling the solver [11, 
Example 4.42]. We use the European network to illustrate that it is beneficial to relax 
non-convex constraints before converting from complex to real numbers. 

4.1. Optimal Power Flow. A transmission network can be modeled using an 
undirected graph Q = (B, C) where buses B = {1,.. .n} are linked to one another via 
lines C C B x B. Power flows are governed by the admittance matrix Y £ (C” xn w i 10se 
extra diagonal terms ( l,m ) £ C are equal to yim/{pmipj^ z ) and whose diagonal terms 

(1,1) are equal to X(/,m) G ,c(^ m + yfD/\Pi™\ 2 - All others terms are equal to zero. 
Here, yi m £ C denotes the mutual admittance between buses ( l,m ) £ C, yf/ n £ C 
denotes the admittance-to-ground at end l of line ( l,m ) £ C , and pi m £ C denotes 
the ratio of the ideal phase-shifting transformer at end l of line (l, to) £ C. 

Each bus injects power p| en +ig® en into the network with capacity limits P™ m , P™ ax , 
r !™ aX (potentially all equal to 0) and extracts power demand p^ em + i q^ em from 
the network. Each bus operates at a voltage Vk £ C. Finding power flows that 
minimize active power loss is a problem that can be cast as an instance of QCQP-C: 


y H i y 

(4.1) inf v H - —v 

v ' vec n 2 

(4.2) s.t. Vfc£0, pt n -p d k em ^v H H k v^pr x -p d k em , 

(4.3) VA: £ B, qf n - ^ v H H k v < q^ x ~ fl£ em , 

(4.4) Vfc £ B, (uf n ) 2 < v H e k e T k v < (v^) 2 , 


where H k := 


Y u e k ej+e k ej Y 


and H k := 


Y H e k ej-e k ejY 


are Hermitian and e k is the 


2 K • 2i 

k th column of the identity matrix. In Section 4.2, power flows are computed that seek 


to minimize either power loss or generation costs a k(p k en ) 2 + ^kP k en + c k where 

a k , b k , c k £ R, a k ^ 0, and pf en = v H H k v+p k em . In the case of generation costs, new 
real variables (t k ) k ^B are introduced, objective (4.1) is replaced by XfceB an< ^ new 
constraints are added for all k £ B: a k (v H H k v + p k em ) 2 + b k (v H H k v +p)J em ) + c k < 
t k . In Section 4.2 apparent power flow limits < s^ x are enforced where 


wfin = v H Fi m v and := aj^eief + 6^e m ef, with ai m ■= (yim + yf m )/\pir 


blm '•— Vim/(PmlPi 


H 


). These can be written for all ( l,m ) £ £: (v 


HFlrn+K 


| 2 and 
-v) 2 + 


(v* 


2i 


v ) 2 ^ (Shi/*) 2 - Note that generation cost and line flow constraints yield 
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second-order conic constraints for all the relaxations considered in this paper as well as 
semidefinite constraints for higher orders of the nroment/sum-of-squares hierarchies. 
The optimal power flow problem is invariant under the action of the torus (Section 3.5) 
due to alternating current. We thus implement invariant hierarchies in Section 4.2.3. 

4.2. Numerical Results. We consider large test cases representing portions of 
European power systems: Great Britain (GB) [72], Poland (PL) [77], and systems 
from the PEGASE project [28,37]. They were preprocessed (see Table 1) to remove 
low-impedance lines in order to improve the solver’s numerical convergence, which 
is a typical procedure in power system analysis. A 1 x 10~ 3 per unit low-impedance 
line threshold was used for all test cases except for PEGASE-1354 and PEGASE-2869 
which use a 3 x ICG 3 per unit threshold. Table 1 includes the at-least-locally-optimal 
objective values obtained from the interior point solver in Matpower [77] for the 
problems after preprocessing. Note that the PEGASE systems specify generation 
costs that minimize active power losses, so the objective values in both columns 
are the same. Implementations use YALMIP 2015.06.26 [46], Mosek 7.1.0.28, and 
MATLAB 2013a on a computer with a quad-core 2.70 GHz processor and 16 GB of 
RAM. The results do not include the typically small formulation times. 

Table 1 

Size of Data (After Low-Impedance Line Preprocessing) 


Test 

Case 

Name 

Number of 
Complex 
Variables 

Number of 
Edges 
in Graph 

Matpower Solution [77] 

Gen. Cost 
($/hr) 

Loss Min. 
(MW) 

GB-2224 

2,053 

2,581 

1,942,260 

60,614 

PL-2383wp 

2,177 

2,651 

1,868,350 

24,991 

PL-2736sp 

2,182 

2,675 

1,307,859 

18,336 

PL-2737sop 

2,183 

2,675 

777,617 

11,397 

PL-2746wop 

2,189 

2,708 

1,208,257 

19,212 

PL-2746wp 

2,192 

2,686 

1,631,737 

25,269 

PL-3012wp 

2,292 

2,805 

2,592,462 

27,646 

PL-3120sp 

2,314 

2,835 

2,142,720 

21,513 

PEGASE-89 

70 

185 

5,819 

5,819 

PEGASE-1354 

983 

1,526 

74,043 

74,043 

PEGASE-2869 

2,120 

3,487 

133,945 

133,945 

PEGASE-9241 

7,154 

12,292 

315,749 

315,749 

PEGASE-9241R 7 

7,154 

12,292 

315,785 

315,785 


4.2.1. Shor Relaxation. Table 2 shows the results of applying SDP-R. and 
SDP-C. They yield global decision variables and the global objective value for the 
cases marked an asterisk (*) in Table 2. For those cases, the eigenvector associated to 
the largest eigenvalue is feasible up to 0.005 p.u. at voltage constraints and 1 MVA at 
all other constraints, and the objective evaluated in the eigenvector matches the bound 
within 0.05% relative to the bound. The lower bounds in Table 2 suggest that the 
corresponding Matpower solutions in Table 1 are at least very close to being globally 
optimal. The gap between the Matpower solutions and the lower bounds from 
SDP-C for the generation cost minimizing problems are less than 0.72% for GB-2224, 

' PEGASE-9241 contains negative resistances to account for generators at lower voltage levels. 
In PEGASE-9241R these are set to 0. 
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0.29% for the Polish systems, and 0.02% for the PEGASE systems with the exception 
of PEGASE-9241. The non-physical negative resistances in PEGASE-9241 result in 
weaker lower bounds, yielding a gap of 1.64%. In accordance with Appendices B 

Table 2 

Real and Complex SDP (Generation Cost Minimization) 


Case 

Name 

SDP-R 

SDP-C 

Val. ($/hr) 

Time (sec) 

Val. ($/hr) 

Time (sec) 

GB-2224 

1,928,194 

10.9 

1,928,444 

6.2 

PL-2383wp 

1,862,979 

48.1 

1,862,985 

23.0 

PL-2736sp* 

1,307,749 

35.7 

1,307,764 

22.0 

PL-2737sop* 

777,505 

41.7 

777,539 

19.5 

PL-2746wop* 

1,208,168 

51.1 

1,208,182 

22.8 

PL-2746wp 

1,631,589 

43.8 

1,631,655 

20.0 

PL-3012wp 

2,588,249 

52.8 

2,588,259 

24.3 

PL-3120sp 

2,140,568 

64.4 

2,140,605 

25.5 

PEGASE-89* 

5,819 

1.5 

5,819 

0.9 

PEGASE-1354 

74,035 

11.2 

74,035 

5.6 

PEGASE-2869 

133,936 

38.2 

133,936 

20.6 

PEGASE-9241 

310,658 

369.7 

310,662 

136.1 

PEGASE-9241R 

315,848 

317.2 

315,731 

95.9 


and C, all objective values in Table 2 match within 0.037%. SDP-C is faster (between 
a factor of 1.60 and 3.31) than SDP-R. Exploiting the isomorphic structure of complex 
matrices in SDP-C is thus better than eliminating a row and column in SDP-R. 

4.2.2. Second-Order Conic Relaxation. Table 3 shows the results of apply¬ 
ing SOCP-R and SOCP-C. Unlike the Shor relaxation, they do not yield the global 
solution to any of the test cases. 8 SOCP-C provides better lower bounds and is faster 
than SOCP-R. Lower bounds from SOCP-C are between 0.87% and 3.96% larger and 
solver times are faster by between a factor of 1.24 and 6.76 than those from SOCP-R. 

4.2.3. Moment/Sum-of-Squares Hierarchy. The real hierarchy globally solves 
a broad class of optimal power flow problems [30,39,51,53] by first converting them 
to real numbers. The dense real and complex hierarchies solve problems up to 10 
buses while the sparse ones solve problems with up 40 buses. In order to solve large- 
scale instances, we apply the mismatch hierarchy of Section 3.8 with the following 
parameters: e := 1 MVA; h, := 2; and A™f x := 2. See Appendix E for a small ex¬ 
ample. The mismatches are taken to be the modulus of the complex number whose 
real part is the mismatch for constraint k in (4.2) and whose imaginary part is the 
mismatch for constraint k in (4.3). In other words, apparent power mismatches are 
considered rather than active and reactive power seperately. To improve numerics, 
\y a ,p + yaCtd\ ^ 2(u max ) a+ ^ and \y a ,p — VcTp\ ^ 2(u max )“ +/3 are added to the complex 
hierarchy and \y a \ ^ (.y max ) Q j s added to the real hierarchy for all |cc|, |/3| ma xH(d) 
where n max := (u” lax ,..., v™ ax ) (see (4.4)). A similar procedure can be found in [74]. 

In Tables 4 and 5, the mismatch hierarchy is applied until the solution obtained 
is feasible up to 0.005 p.u. at voltage constraints and 1 MVA at all other constraints 9 , 

8 SOCP-C generally does not provide a global solution with the exception of radial systems when 
certain non-trivial technical conditions are satisfied [47]. 

^Typical violations are smaller than 1 MVA. For instance, with the complex hierarchy PL-3012wp 
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Table 3 

Real and Complex SO CP (Generation Cost Minimization) 


Case 

Name 

SOCP-R 

SOCP-C 

Val. ($/hr) 

Time (sec) 

Val. ($/hr) 

Time (sec) 

GB-2224 

1,855,393 

3.5 

1,925,723 

1.4 

PL-2383wp 

1,776,726 

8.5 

1,849,906 

2.4 

PL-2736sp 

1,278,926 

4.8 

1,303,958 

1.7 

PL-2737sop 

765,184 

5.5 

775,672 

1.6 

PL-2746wop 

1,180,352 

5.1 

1,203,821 

1.7 

PL-2746wp 

1,586,226 

5.5 

1,626,418 

1.7 

PL-3012wp 

2,499,097 

5.9 

2,571,422 

2.0 

PL-3120sp 

2,080,418 

6.2 

2,131,258 

2.2 

PEGASE-89 

5,744 

0.5 

5,810 

0.4 

PEGASE-1354 

73,102 

3.4 

73,999 

1.5 

PEGASE-2869 

132,520 

9.0 

133,869 

2.7 

PEGASE-9241 

306,050 

35.3 

309,309 

10.0 

PEGASE-9241R 

312,682 

36.7 

315,411 

5.4 


and until the objective evaluated in the solution matches the bound within 0.05% 
relative to the bound. The optimal values in the two tables match to at least 0.007%, 
which is within the expected solver tolerance. Further, they match the optimal values 
for the loss minimizing problems in Table 1 to within 0.013%, further proving that they 
are globally optimal. However, local solvers do not always globally solve the optimal 
power flow [12,15,50,53]. Though both hierarchies solve many small- and medium-size 
test cases which minimize generation cost, the mismatch hierarchy requires too many 
higher-order constraints for larger generation-cost-minimizing test cases. 

The feasible set of the optimal power flow problem is included in the ball of 
radius ( y] k ax ) 2 so a slack variable and a sphere constraint may be added as 

suggested in Section 3.3. In order to preserve sparsity, a slack variable and a sphere 
constraint may be added for each maximal clique of the chordal extension of the 
network graph. However, it tends to introduce numerical convergence challenges in 
problems with several thousand buses, resulting in higher-order constraints at more 
buses and correspondingly longer solver times. Interestingly, the results in Table 5 
were obtained without the slack variables and sphere constraints. A potential way 
to account for this would be to compute the Hermitian complexity [26] of the ideal 
generated by the polynomials associated with equality constraints. A step in that 
direction would be to assess the greatest number of distinct points (possibly infinite) 
v l £ C", 1 ^ i < p, such that (v’) H (Hk + i Hk)v^ = —— ig)! em for all buses k not 
connected to a generator and for all 1 ^ i, j ^ p. The Hermitian complexity of the 
ideal generated by Y^i =l \ z i\ 2 + a ( z ) + o, as defined in (3.15) with a < 0 is equal to 1. 

Tables 4 and 5 show that the complex hierarchy has advantages over the real 
hierarchy. In all cases except PEGASE-1354, there is a speedup factor in solver time 
of between 1.31 and 21.42. The most significant improvements are seen for cases 
(e.g., PL-2383wp and PL-2746wop whose biggest maximal clique has 19 nodes) where 
the higher-order constraints account for a large portion of the solver times. This is 
due to fewer terms in the higher-order constraints. There is also a speedup in solver 


has over 99% of the buses with less than 0.02 MVA violation, and only 0.09% of the buses with greater 
than 0.1 MVA violation. Maximum line flow viotation is 0.0006 MVA. 
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Table 4 

Real Moment/Sum-of-Squares Hierarchy (Active Power Loss Minimization) 


Case 

Name 

Num. 

Iter. 

Global Obj. 
Val. (MW) 

Max. Viol. 
(MVA) 

Solver 
Time (sec) 

PL-2383wp 

3 

24,990 

0.25 

583.4 

PL-2736sp 

1 

18,334 

0.39 

44.0 

PL-2737sop 

1 

11,397 

0.45 

52.4 

PL-2746wop 

2 

19,210 

0.28 

2,662.4 

PL-2746wp 

1 

25,267 

0.40 

45.9 

PL-3012wp 

5 

27,642 

1.00 

318.7 

PL-3120sp 

7 

21,512 

0.77 

386.6 

PEGASE-1354 

5 

74,043 

0.85 

406.9 

PEGASE-2869 

6 

133,944 

0.63 

921.3 


Table 5 

Complex Moment/Sum-of-Squares Hierarchy (Active Power Loss Minimization) 


Case 

Name 

Num. 

Iter. 

Global Obj. 
Val. (MW) 

Max. Viol. 
(MVA) 

Solver 
Time (sec) 

PL-2383wp 

3 

24,991 

0.10 

53.9 

PL-2736sp 

1 

18,335 

0.11 

17.8 

PL-2737sop 

1 

11,397 

0.07 

25.7 

PL-2746wop 

2 

19,212 

0.12 

124.3 

PL-2746wp 

1 

25,269 

0.05 

18.5 

PL-3012wp 

7 

27,644 

0.91 

141.0 

PL-3120sp 

9 

21,512 

0.27 

193.9 

PEGASE-1354 

11 

74,042 

1.00 

1,132.6 

PEGASE-2869 

9 

133,939 

0.97 

700.8 


time of between 2.0 and 5.9 for 7 out of the 8 small- to moderate-size generation-cost¬ 
minimizing test cases in [53], the exception being case39Q due to numerical difficulties. 
For those 7 cases, the maximum violation for the complex hierarchy is 0.08 MVA, with 
the remaining case (casell8Q) having a maximum violation of 0.32 MVA. 

PL-3012wp, PL-3120sp, PEGASE-1354, and PEGASE-2869 require more iter¬ 
ations in the complex case than the real one. However, the improved speed per 
iteration results in faster overall solution times for all of these test cases except for 
PEGASE-1354, for which 6 additional iterations result in a factor of 2.78 slower solver 
time. Interestingly, the dense versions of the real and complex hierarchies yield the 
same bounds at each order for small test cases 10 buses) from [12,44,50,55]. 

5. Conclusion. We construct a complex moment/sum-of-squares hierarchy for 
complex polynomial optimization and prove convergence toward the global optimum. 
Theoretical and experimental evidence suggest that relaxing non-convex constraints 
before converting from complex to real numbers is better than doing the operations 
in the opposite order. We conclude with the question: is it possible to gain efficiency 
by transposing convex optimization algorithms from real to complex numbers? 
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Appendix A. Rank-2 Condition. It is proven here that a Hermitian matrix 
Z is positive semidefinite and has rank 1 if and only if A (Z) is positive semidefinite 
and has rank 2. 

(=>) Say Z = zz H where real and imaginary parts are defined by £ = x\ + ' 1 x 2 
and ( Xi,X 2 ) ^ (0,0). Then 

(A.la) A (Z) = ( XlX l + X2X l XlX l ~ X2X l ) 

\ % 2%1 — X\X 2 X\X{ + X2X2 J 

The rank of A (Z) is equal to 2 since ( xf x 2 ) T and ( {—X 2 ) T xj ) T are non-zero 
orthogonal vectors. 

(<S=) Say A (Z) = xx T + yy T where x and y are non-zero real vectors. Consider 
the block structure x = ( xf x 2 ) T and y = ( yj y 2 ) T ■ For i = 1,..., n, it must be 
that 

(A.2a) x\i + y\i = x 2 2i + y\, 

(A.2b) x u X2i + yuV2% = 0. 


Two cases can occur. The first is that xuX2i 0 in which case there exists a real 
number \ ^ 0 such that 

(A.3) ( Vli = X2U 

[ V 2 i = JT Xu. 

Equation (A.2a) implies that (1 — X 2 )x\ t = (1 — jz)x 2 i thus (1 — A?)(l — y?) ^ 0 
and Xi = ±1. The second case is that XuX 2 i = 0. Then, according to (A.2b), 
yuV 2 i = 0. If either xu = yu = 0 or X 2 i = y 2 i = 0, then (A.2a) implies that 
xu = X2i = yu = y2i = 0 . If Xu = y2i = o, then (A.2a) implies that yu = ±X2i- If 
X2i = yu = 0, then (A.2a) implies that y 2 i = ±Xu- 
In any case, there exists e, = ±1 such that 


(A.4) 


f Vli — % 2 ii 

\ 1/2i = €i %li- 


For i, j = 1,..., n it must be that 


(A.5a) 

(A.5b) 

Moreover 

(A.6) 


(1 - ei€j)(xuXij - X2iX 2 j) = 0 , 
(1 - eiCj)(xijX2i + XuX2j ) = 0. 


f “ 1 “ VliVlj — H - ^i€jX 2 i% 2 j ? 

%li%2j H - y\%y2j — %li%2j €i€j%2i%lj • 


It will now be shown that 


(A.7) 


H - ynyij 
%ii%2j H - yny2j 


H - %2i%2j 5 
3'li%2j •X'2i%lj • 
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It is obvious if e^ej = 1. If e *Cj = —1, then (A.5a)-(A.5b) imply 

(A.8a) xuxij — x 2 iX 2 j = 0, 

(A.8b) xijX2i + xuX2j = 0. 

If xuXijX 2 iX 2 j = 0, it can be seen that (A.7) holds. If not, (A.8a) implies that there 
exists a real number puj ^ 0 such that 


(A.9) 


x *i = In, 


Further, (A.8b) implies that (/Mj + -jj—)x\jX 2 i = 0. This is impossible (p,ij + 777 ^ 0 
and XijX 2 i ^ 0). Thus, (A.7) holds. 

With the left hand side corresponding to A (Z) = xx T + yy T and the right hand 
side corresponding to (A.lb), equation (A.7) implies that A (Z) is equal to (A.lb). 
Since the function A is injective, it must be that Z = (aq + iaq^aq + 1x2) H ■ 

Appendix B. Invariance of Shor Relaxation Bound. We have val(CSDP-R) 
^ val(SDP-R) since the feasible set is more tightly constrained due to (2.4d). To prove 
the opposite inequality, define A(X) := (A + C)/2 + i (B — B T )/2 for all X G 82™ 
using the block decomposition in the left hand part of (2.4d). It is proven here 
that if A is a feasible point of SDP-R, then A o A(X) is a feasible point of CSDP-R 
with same objective value as X. Firstly, A o A(X) satisfies (2.4d) because A(X) is a 
Hermitian matrix. Secondly, in order to show that Ao A(X) satisfies (2.4c), notice 
that if x = ( xf x x ) T then 



Hence A o A(X) is equal to the sum of two positive semidefinite matrices. Fi¬ 
nally, to prove that A o A(A) satisfies (2.4b) and has same objective value as X , 
notice that if H G H„ and Y G §2™, then Tr [A(H)Y] = Ei<jj;g2n ■M-fOdXi* = 
El^iq^2ra = + Im (H)ijBij + ( — llTi(H)ij)(B T )ij + 

Re(H)ijCij = Ei^ij^n Re(Hij)(A + C)ij + lm(Hij)(B — B T )ij = ... 

2Ei W<n Wi(AQOtf)*] = 2E(*(*%•)* = 2Tv[HA(Y)}. Complet¬ 
ing the proof, for all H € H„, Tr[A(R) A o A(A)] = 2Tr[RA(A)] = Tr [A (H)X). 

Appendix C. Invariance of SDP-K Relaxation Bound. We assume that X 
is a feasible point of SDP-K. and construct a feasible point of SDP-R with same objec¬ 
tive value and first diagonal entry equal to 0. Consider the eigenvalue decomposition 
X = ELi x k x k for some Xk G R 2 " and p G N. For all 9 G K, define 


(C.l) R, := A[c«(»)4 U-W/.1 = ( ) . 

For k = 1,... ,p, define 9 k G K such that x k , n +i + i£fc,i = : x kn+1 + x kl e l8k . Con¬ 
struct X := Y^k=i(R s k x k)(Re k x k) T 0 whose first diagonal entry is equal to 0. If 
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H G B n , Tr(A (H)X) = ELi Tr[A {H)R 9k x k x T k R^] = ELi TV[P£A(P)P efc x fc x£] = 
Efc=i Tr [ A {( cos ( 0 fc)/n - ism(9 k )I n )H(cos(6 k )I n + isin (d k )I n )}x k xl] = ... 
ELiTi'[A(77)x fe ^] = Tr(A(ff)X). 

Appendix D. Discrepancy Between Second-Order Conic Relaxation 
Bounds. We have val(CSOCP-R) ^ val(SOCP-R) since the feasible set is more 
tightly constrained. The opposite inequality between optimal values does not hold, 
and this can be proven by considering the example QCQP-C defined by inf 2lj22g c (1 + 
i)^i~2 + (1 — i)z2Zi s.t. Z\Z\ < 1 , Z2Z2 ^ 1 - CSOCP-R yields the globally optimal 
value of — 2\/2, while SOCP-R yields —4. 

Appendix E. Five-Bus Illustrative Example for Exploiting Sparsity. 

Consider the five-bus optimal power flow problem in [12] which is an instance of 
QCQP-C. Let ind(-) denote the set of indices corresponding to monomials of either 
the objective / or constraint functions We have 


ind(/) = {(1,1), (1,2), (1,3), (3,5), (4,5), (5,5)}, 
ind(si) =ind(g 2 ) = {( 1 , 1 ), ( 1 , 2 ), ( 1 ,3)} 

[PfEQfl , 

ind( 53 ) = ind( ff4 ) = {( 1 , 2 ), ( 2 , 2 ), ( 2 , 3 ), ( 2 , 4 )} 

[-^ 2 , Q 2 ] , 

ind(«? 5 ) = md( S6 ) = {(1, 3), (2,3), (3, 3), (3, 5)} 

[P-i-.Q-A: 

ind( 5 r) = ind (g 8 ) = {(2,4), (4,4), (4, 5)} 

[P 4 , Q4 , 

ind(g 9 ) = ind( 3 io) = {(3, 5), (4, 5), (5, 5)} 

[ pmin nminl 
1/5 > ^5 J J 

ind(gn) = ind(gi 2 ) = {( 1 , 1 )} 

[vr 11 , v 1 max ] , 

ind( 513 ) = ind(gi 4 ) = {( 2 , 2 )} 

rT/-min T/maxl 
[ v 2 5 v 2 ] J 

ind ( 515 ) = ind(gi 6 ) = {(3,3)} 

[T/min T/maxl 

L^3 > v 3 J ’ 

ind(gi 7 ) = ind(gi 8 ) = {(4,4)} 

fT/min T/maxl 

|_M » V 4 J 5 

ind (fog) = ind(g 2 o) = {(5, 5)} 

[T/min T/maxl 

l V 5 5 v 5 J 5 


where the text in brackets indicates the origin of the constraint: P* and Qi for active 
and reactive power injection equality constraints, P™ 11 and Q™ 111 for lower limits 
on active and reactive power injections, and T/ min and j/. max for squared voltage 
magnitude limits at bus i. For brevity, the sphere constraints discussed in Section 3.3 
are not enforced in this example. Regardless, the complex hierarchy with = 1, V* G 
{1,2,3,4,5,6,11,12,13,14,15,16}, d z = 2, Vi G {7,8,9,10,17,18,19,20}, yields the 
global solution. Second-order constraints are identified using the mismatch hierarchy. 

The graph ({1,..., 5}, £ mono ) corresponding to (E.l) is shown in Fig. 4 where 
each node i corresponds to a complex variable Zi. Edges £ mono , which are denoted 
by solid lines in Fig. 4, connect variables that appear in the same monomial in any of 
the constraint equations or objective function. The supergraph ({1,..., 5}, £ con ) has 
edges £ con comprised of £ mono (solid lines in Fig. 4) augmented with edges connecting 
all variables within each constraint with di > 1 (dashed lines in Fig. 4). In this case, 
the supergraph is already chordal, so there is no need to form a chordal extension 
Q ch . The maximal cliques of the supergraph are C\ = {1, 2, 3} and C 2 = {2, 3,4, 5}. 
Clique C 2 is the minimal covering clique for all second-order constraints (jj (z) ^ 
0, Vi G {7,8,9,10,17,18,19,20}. The order associated with C 2 is two (df = 2) 
since the highest order di among all constraints for which C 2 is the minimal covering 
clique is two. Clique Ci is not the minimal covering clique for any constraints with 
di > 1, so df = 1. The globally optimal objective value obtained from the complex 
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Fig. 4. Graph Corresponding to Equations (E.l) from Five-Bus System in [12] 


hierarchy specified above is 946.8 with corresponding decision variable 2 = (1.0467 + 
O.OOOOi, 0.9550 - 0.0578i, 0.9485 - 0.0533i, 0.7791 + 0.6011i, 0.7362 + 0.7487i) T . 
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